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invention, including patents, patent applications and various publications, as well as 
biological sequences (e.g., nucleic acid and polypeptide sequences) that are publicly 
available through various databases such as the GenBank database. The citation and/or 
discussion of such references is provided merely to clarify the description of the present 
invention and is not an admission that any such reference is "prior art" to the invention 
described herein. All references cited and discussed in this specification are incorporated 
herein by reference in their entirety and to the same extent as if each reference was 
individually incorporated by reference. 



used to diagnose and treat neuropsychiatric disorders, including schizophrenia, 
schizoaffective disorder, bipolar disorder, unipolar affective disorder and adolescent conduct 
disorder. 

The invention also relates to particular genes, referred to as DISCI and 
DISC2, and their gene products. These genes are demonstrated herein to be associated with 



Numerous references are cited and discussed in the description of the 



1. field of the invention 



The present invention relates to compositions and methods which may be 



neuropsychiatry disorders (including schizophrenia, schizoaffective disorder, bipolar 
disorder, unipolar affective disorder and adolescent conduct disorder) and are therefore 
useful in the methods of the present invention. 

The invention further relates to particular polymorphisms of the genes DISC 1 
5 and DISC2, including particular single nucleotide polymorphisms (SNPs). These novel 
polymorphisms are demonstrated here to co-segregate with neuropsychiatric disorders 
(particularly disorders such as schizophrenia, schizoaffective disorder, bipolar disorder, 
unipolar affective disorder and adolescent conduct disorder). The polymorphisms are useful, 
therefore, in novel methods of the invention for diagnosing and treating these disorders. 

10 

2. BACKGROUND OF THE INVENTION 

There are only a few psychiatric disorders in which clinical manifestations 
of the disorder can be correlated with demonstrable defects in the structure and/or function 
of the nervous system. The vast majority of psychiatric disorders, however, involve subtle 

15 and/or undetectable changes and the cellular and molecuJar levels of nervous system 
structure and function. This lack of discernable neurological defects distinguishes 
"neuropsychiatric disorders" such as schizophrenia, attention deficit disorder (ADD), 
schizoaffective disorder, bipolar affective disorders and unipolar affective disorder from 
neurological disorders in which anatomical or biochemical pathologies are manifest. Hence, 

20 identification of causative defects in neuropathologies of neuropsychiatric disorders is 
needed so that clinicians may diagnose, evaluate and prescribe appropriate treatments for 
these disorders. 

Schizophrenia is one example of a particularly serious and debilitating 
neuropsychiatric disorder that affects approximately 1% of the worldwide population. 
25 Currently, individuals may be evaluated for schizophrenia and other neuropsychiatric 
disorders using the criteria set forth in the most recent version of the American Psychiatric 
Association's Diagnostic and Statistical Manual of Mental Disorders (DSM-IV). 

There is compelling evidence from family, twin and adoption studies for a 
significant genetic basis to schizophrenia and other neuropsychiatric disorders (McGuffin 



et al , Lancet 1 995, 346:678-682). This has initiated searches directed towards identification 
of the genetic component or components of neuropsychiatric disorders using such methods 
as linkage analysis, association studies of candidate genes and mapping of cytogenetic 
abnormalities in psychiatric patients. However, while such techniques have been applied 
5 successfully to monogentic disorders, neuropsychiatric disorders apparently result from 
combined effects of multiple genes and environmental factors (see, McGuffin et al , supra). 
Such effects have complicated efforts to identify genetic components for these diseases. 

Recent studies of a large Scottish family have identified a balanced 
(1;1 1)(q42.1;ql4.3) translocation which segregates with major mental illness (see, St Clair 

10 et al, Lancet 1990, 336:13-16; Millar et al, Psychiatric Genet. 1998, 8:175-181). 
Sequencing of the chromosome 1 breakpoint of this translocation has revealed two genes, 
referred to as Disrupted-In-Schizophrenia 1 and 2, or DISCI (GenBank Accession No. 
AF222980) and DISC2 (GenBank Accession No. AF222981), respectively. The genes 
DISCI and DISC2 are directly disrupted by the translocation, suggesting that they should 

15 be considered formal candidate genes for susceptibility to neuropsychiatric disorders, 
particularly disorders such as schizophrenia, schizoaffective disorder, bipolar affective 
disorder, unipolar affective disorder and adolescent conduct disorder (Millar et al, Human 
Molecular Genetics 2000, 9:1415-1423). 

It has been suggested, however, that the Scottish family in which DISC 1 and 

20 DISC2 are disrupted may be atypical of other families and individuals affected by 
neuropsychiatric disorders such as schizophrenia, (see, Millar et al, Human Molecular 
Genetics2000, 9:1415-1423). In particular, underlying genetic defects in DISCI andDISC2 
which are associated with neuropsychiatric disorders in other families or individuals are, 
most likely, different from the translocation observed for the particular Scottish family 

25 described by Millar et al 

There continues to exist, therefore, a need to identify specific genetic defects, 
mutations and polymorphisms that are associated with neuropsychiatric disorders such as 
schizophrenia, schizoaffective disorder, bipolar affective disorder, unipolar affective disorder 
and adolescent conduct disorder. 



There further exists a need for compositions and methods to treat and/or 
diagnose these and other neuropsychiatry disorders, e.g., by identifying and correcting 
specific genetic defects, mutations and polymorphisms that are associated with such 
neuropsychiatric disorders. 
5 There also continues to exist a need to identify specific genetic defects, 

mutations, and polymorphisms of the genes DISCI and DISC2 that are associated with 
neuropsychiatric disorders such as schizophrenia, schizoaffective disorder, bipolar affective 
disorder, unipolar affective disorder and adolescent conduct disorder. 

There further exists a need for compositions and methods to treat and/or 
1 0 diagnose these and other neuropsychiatric disorders using specific genetic defects, mutations 
and polymorphisms of the genes DISCI and DISC2. 



3. SUMMARY OF THE INVENTION 

The present invention overcomes these and other problems in the art by 
15 providing novel compositions and methods that may be used to diagnose and treat a 
neuropsychiatric disorder such as, but not limited to, schizophrenia, schizoaffective disorder, 
bipolar affective disorder, unipolar affective disorder and adolescent conduct disorder. For 
example, in one embodiment the invention provides novel nucleic acid molecules which are 
associated with a neuropsychiatric disorder. These novel nucleic acid molecules are ones 
20 that correspond to genes known in the art as M Disrupted-in-Schizophrenia 1 " (DISCI) and 
"Disrupted-in-Schizophrenia T (D1SC2), respectively. The novel nucleic acid molecules 
of the invention are ones which contain one or more polymorphisms or variant nucleic acid 
sequences that are associated with a neuropsychiatric disorder such as schizophrenia, 
schizoaffective disorder, bipolar affective disorder, unipolar affective disorder and adolescent 
25 disorder. In preferred embodiments the one or more polymorphisms or variant nucleic acid 
sequences comprise one or more single nucleotide polymorphisms (SNPs). 

For example, in one embodiment the invention provides a nucleic acid 
molecule comprising the nucleotide sequence set forth in FIGS. 1A-C (SEQ ID NO:l) and 
containing one or more single nucleotide polymorphisms. In one specific aspect of this 



embodiment, the nucleotide sequence comprises one or more of the nucleotide substitutions 
set forth in TABLE 5 shown in the Examples, infra. In another embodiment, the invention 
provides nucleic acid molecules comprising any of the nucleotide sequences set forth in 
TABLE 3 shown in the Examples, infra, and containing one or more single nucleotide 
5 polymorphisms. In a specific aspect of this embodiment, the nucleotide sequence comprises 
one or more of the nucleotide substitutions set forth in TABLE 5 shown in the Examples, 
infra. 

The invention also provides gene products that are encoded by the DISC 1 and 
DISC2 nucleic acid molecules of the invention. In one preferred embodiment, the invention 

10 provides a polypeptide having the amino acid sequence shown in FIG. 2 (SEQ ID NO:2) 
with one or more amino acid substitutions. For example, in specific aspects of this 
embodiment the polypeptide comprises the amino acid sequence shown in FIG. 2 (SEQ ID 
NO: 2) with one or more of the specific amino acid substitutions set forth in TABLE 6B 
shown in the Examples, infra. 

15 The invention additionally provides antibodies the specifically bind to a 

DISCI or DISC2 gene product of the invention. For example, the invention provides 
antibodies that specifically bind to a polypeptide having the amino acid sequence shown in 
FIG. 2 (SEQ ID NO:2) and one or more of the specific amino acid substitutions set forth in 
TABLE 6B shown in the Examples, infra. In a particular, preferred aspect of this 

20 embodiment, the antibody is a monoclonal antibody. 

In other embodiments, the invention provides novel methods for identifying 
individual who have a neuropsychiatric disorder or who have a propensity for a 
neuropsychiatric disorder. In one embodiment, the invention provides a method for 
identifying an individual who has a neuropsychiatric disorder or a propensity for a 

25 neuropsychiatric disorder by detecting, in a biological sample obtained from an individual, 
the presence of one the specific DISCI or DISC2 nucleic acid molecules of the invention. 
Detection of the specific DISCI or DISC2 nucleic acid molecule in the biological sample 
identifies the individual as one who has a neuropsychiatric disorder or who has a propensity 
for a neuropsychiatric disorder. In one exemplary aspect of this embodiment, the nucleic 



acid molecule detected is a nucleic acid molecule comprising the sequence set forth in FIGS. 
1A-C (SEQ ID NO:l) and having one or more of the nucleotide substitutions set forth in 
TABLE 6 A shown in the Examples, infra. In another exemplary aspect of this embodiment, 
the nucleic acid molecule detected is a nucleic acid molecule comprising a sequence selected 
5 from those set forth in TABLE 3 shown in the Examples, infra, and having one or more of 
the nucleotide substitutions set forth in TABLE 5 shown in the Examples, infra. 

In an alternative embodiment, the invention provides a method for identifying 
an individual who has a neuropsychiatric disorder or a propensity for a neuropsychiatric 
disorder by detecting, in a biological sample obtained from an individual, the presence of a 

10 specific DISCI or DISC2 gene product (e.g , a polypeptide) of the invention. Detection of 
the specific DISCI or DISC2 gene product in the biological sample identifies the individual 
as one who has a neuropsychiatric disorder or who has a propensity for a neuropsychiatric 
disorder. In one particular aspect of this embodiment, the gene product detected is a 
polypeptide comprising the amino acid sequence set forth in FIG. 2 and having one or more 

15 of the amino acid substitutions set forth in TABLE 6B, shown in the Examples, infra. In 
other, exemplary, aspects of this embodiment, the polypeptide is detected by contacting, to 
the biological sample, an antibody that specifically binds to the polypeptide, and detecting 
the binding of this antibody to the polypeptide in the sample. 

20 4. BRIEF DESCRIPTION OF THE DRAWINGS 

FIGS. 1A-C sets forth the nucleotide sequence of the human DISCI cDNA 
(SEQ ID NO:l) given in GenBank Accession No. AF222980.1 (GE8163868). 

FIG. 2 shows the predicted amino acid sequence (SEQ ID NO: 2) encoded 
25 by the human DISCI cDNA set forth in FIGS. 1A-C(SEQ IDNO:l). Amino acid residues 
of the predicted DISCI polypeptide which are predicted to form a-helices are underlined, 
whereas a predicted globular domain of the DISCI polypeptide is indicated by bold-faced 
type. Alternatively spliced amino acid residues of the DISCI polypeptide are italicized. 
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FIGS. 3A-G sets forth the partial nucleotide sequence of the human DISC2 
cDNA given in GenBank Accession No. AF222981 (GI:8 163870) and presented, here, in 
SEQ ID NO: 3. 

5 FIG. 4 describes primer sequences (SEQ ID NOS: 44-127) for identifying 

and/or amplifying the DISCI and/or DISC2 polymorphisms of the invention (see Example 
3, infra). 

5. DETAILED DESCRIPTION OF THE INVENTION 

1 0 The present invention relates to a gene known as Disrupted-In-Schizophrenia 

1 or DISCI (GenBank Accession No. AF222980; GI:8163868) and to its gene product. The 
invention also relates to a gene known as Disrupted-In-Schizophrenia 2 or DISC2 (GenBank 
Accession No. AF222981; GI:8163870). Both DISCI and DISC2 have been previously 
described. See, in particular, Millar et aL, Human Molecular Genetics 2000, 9:1415-1423. 

1 5 The DISC 1 cDNA sequence (SEQ ID NO: 1 ) is set forth here in FIGS. 1 A C 

The complete cDNA sequence comprises a sequence of 6913 nucleotides (nucleotides 1- 
69 1 3 of SEQ ID NO: 1 ) and a poly(A) tail. The sequence further comprises a protein coding 
sequence 2565 nucleotides in length that begins with the ATG codon at nucleotide position 
54 in FIGS. 1A-C (SEQ ID NO:l) and ends at the STOP codon located at position 2616 in 

20 FIGS. 1A-C (SEQ ID NO: 1). The DISCI cDNA sequence set forth in FIGS. 1A-C and in 
SEQ ID N 0:1 also contains a 3'-untranslated region (UTR) of about 4294 nucleotides that 
begins after the STOP codon of the protein coding sequence (i.e., at about nucleotide 
position 2619 in SEQ ID NO:l). A common splice variant of the DISCI cDNA sequence 
depicted in FIGS. 1A-C and in SEQ ID NOT is also known to exist. This splice variant 

25 comprises a sequence which is essentially identical to the DISCI cDNA sequence set forth 
in FIGS. 1A-C and in SEQ ID NO:l, except that the nucleotide sequence corresponding to 
nucleotide positions 2295-2360 is omitted from this splice variant. 

The coding sequence of the DISCI cDNA set forth in FIGS. 1A-C and in 
SEQ ID NOT encodes a predicted DISCI polypeptide having the amino acid sequence 
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shown in FIG. 2 and set forth in SEQ ID NO:2. This DISCI polypeptide may be divided 
into at least two regions of distinct secondary structure. The first region, which is also 
referred to as the N-terminal region, comprises approximately amino acid residues 1 -347 of 
SEQ ID NO: 1 and has been predicted to comprise at least one globular domain. The second 
5 region of secondary structure is also referred to as the C-terminal domain and comprises 
approximately amino acid residues 348-854 of SEQ ID NO:2. The C-terminal comprises at 
least three a-helix regions which are indicated by bold underlining in FIG. 2. These a- 
helices correspond to approximately amino acid residues 367-394, 452-500 and 602-630 of 
SEQ ID NO:2. It is understood that the a-helices contained in the DISC 1 C-terminal region 

10 may interact with each other to form coiled coil structures. It is understood that the 
nucleotide and amino acid residue positions specified to delineate all regions and domains 
of the DISCI gene and gene products are approximate. 

As noted above ? a splice variant of the DISCI cDNA sequence set forth in 
FIGS. 1A-C (SEQ ID NO:l) is also known to exist This DISCI splice variant is predicted 

15 to encoded a polypeptide sequence that is essentially identical to the DISC 1 polypeptide set 
forth in FIG. 2 (SEQ ID NO:2). However, the sequence corresponding to amino acid 
residues 748-768 of SEQ ID NO:2 is omitted from this DISCI polypeptide splice variant. 

The complete genomic DNA sequence for DISCI has not been produced. 
However, the complete sequence may be partially represented by partial genomic sequences, 

20 provided herein in TABLE 3 (see, Examples, infra, and SEQ ID NO: 4). The DISCI 
genomic sequence comprises at least thirteen exons, which may be transcribed and spliced 
together to form a DISCI mRNA. The boundaries of these exons/introns and their location 
in the partial genomic sequences are provided in TABLE 4 (see, Examples, infra, and SEQ 
IDNOS:5-32). 

25 A partial DISC2 cDNA sequence (GenBank Accession No. AF222981, 

GI:8163870) is set forth in FIGS. 3A-G (SEQ ID NO:3). The genomic sequence of the 
human DISC2 gene is believed to overlap on human chromosome 1 with the sequence for 
DISCI, but is transcribed in the opposite direction. The DISC2 cDNA set forth in FIGS. 
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3A-G (SEQ ID N0:3) is not predicted to encode a DISC2 polypeptide or protein. Rather, 
DISC2 is thought to be a non-protein coding gene for a structural RNA. 

The present invention relates, in particular, to novel variants of the DISCI and 
DIS2 nucleic acids, including novel mutants, allelic variants and polymorphisms of full 
5 length DISCI and DISC2 nucleic acids. Such variants include DISCI and DISC2 nucleic 
acids which have one or more nucleic acid substitutions, insertions or deletions. In a 
preferred embodiment, the DISCI and DISC2 nucleic acids of the invention are ones that 
have a single nucleotide polymorphism (SNP). Such SNPs include single base insertions, 
deletions and substitutions. In particularly preferred embodiments, the variant DISCI and 

10 DISC2 nucleic acids of the invention are ones having a mutations or polymorphism which 
correlates with a neuropsychiatric disorder such as schizophrenia, schizoaffective disorder, 
bipolar disorder, unipolar affective disorder and adolescent conduct disorder. For example, 
preferred polymorphisms of the DISCI and DISC2 nucleic acids of the invention are 
described in the Examples, infra, including ones that correlate to neuropsychiatric disorders 

1 5 such as schizophrenia, schizoaffective disorder, bipolar disorder, unipolar affective disorder 
and adolescent conduct disorder. 

The present invention also relates to fragments of full length DISCI and 
DISC2 nucleic acid sequences. For example, in preferred embodiments the DISCI and 
DISC2 nucleic acid fragments of the invention comprise nucleotide sequences that 

20 correspond to at least 10 nucleotides, preferably at least 15 nucleotides, more preferably at 
least 20 nucleotides and still more preferably at least 25 nucleotides of a full length DISCI 
or DISC2 nucleic acid sequence (e.g., of the DISCI and DISC2 nucleic acid sequences set 
forth in FIGS. 1A-C and in FIGS. 3A-G, respectively, and in SEQ ID NOS:l and 3). In 
other preferred embodiments, the DISCI and DISC2 nucleic acid fragments of the invention 

25 comprise sequences of at least 1 0, preferably at least 1 5, more preferably at least 20 and still 
more preferably at least 25 nucleotides that hybridize, under conditions defined in detail 
below (e.g. , for oligonucleotide molecules), to a full length DISCI or DISC2 sequence (for 
example, to one of the DISCI or DISC2 sequences set forth in FIGS. 1A-C and in FIGS. 
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3A-G, respectively, and in SEQ ID N0S:1 and 3) or to the complement of a full length 
DISCI or DISC2 sequence. 

Such DISCI and DISC2 nucleic acid fragments may be used in the methods 
of the invention, e.g., to diagnose and/or treat a neuropsychiatric disorder such as 
5 schizophrenia, schizoaffective disorder, bipolar disorder, unipolar affective disorder and 
adolescent conduct disorder. Accordingly, in a preferred embodiment DISCI and DISC2 
nucleic acid fragments of the invention comprise nucleotide sequences of at least 1 0, at least 
15, at least 20 or at least 25 nucleotides which comprise at least one polymorphic sequence 
of a DISCI or DISC2 nucleic acid, such as one of the DISCI or DISC2 polymorphic 

10 sequences described in the Examples, infra. In another preferred embodiment, DISCI and 
DISC2 nucleic acid fragments of the invention may comprise nucleotide sequences of at least 
10, at least 15, at least 20 or at least 25 nucleotides that specifically hybridize, under 
conditions defined in detail below (e.g. , for oligonucleotides) to a DISC 1 or DISC2 sequence 
(or its complement) that contains at least one polymorphism, such as one of the DISCI or 

15 DISC2 polymorphisms described in the Examples, infra. 

5. 1 Definitions 

The terms used in this specification generally have their ordinary meanings 
in the art, within the context of this invention and in the specific context where each term is 
20 used. Certain terms are discussed below, or elsewhere in the specification, to provide 
additional guidance to the practitioner in describing the devices and methods of the invention 
and how to make and use them. 

General definitions 

25 The term "neuropsychiatric disorder", which may also be referred to as a 

"major mental illness disorder" or "major mental illness", refers to a disorder which may be 
generally characterized by one or more breakdowns in the adaptation process. Such 
disorders are therefore expressed primarily in abnormalities of thought, feeling and/or 
behavior producing either distress or impairment of function (i.e., impairment of mental 
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function such with dementia or senility). Currently, individuals may be evaluated for various 
neuropsychiatric disorders using criteria set forth in the most recent version of the American 
Psychiatric Association's Diagnostic and Statistical Manual of Mental Health (DSM-IV). 

Exemplary neuropsychiatric disorders include, but are not limited to, 
5 schizophrenia, attention deficit disorder (ADD), schizoaffective disorder, bipolar affective 
disorder, unipolar affective disorder, and adolescent conduct disorder. 

As used herein, the term "isolated 1 ' means that the referenced material is 
removed from the environment in which it is normally found. Thus, an isolated biological 
material can be free of cellular components, / e. , components of the cells in which the 

1 0 material is found or produced. In the case of nucleic acid molecules, an isolated nucleic acid 
includes a PGR product, an isolated mRNA, a cDNA, or a restriction fragment. In another 
embodiment, an isolated nucleic acid is preferably excised from the chromosome in which 
it may be found, and more preferably is no longer joined to non-regulatory, non-coding 
regions, or to other genes, located upstream or downstream of the gene contained by the 

1 5 isolated nucleic acid molecule when found in the chromosome. In yet another embodiment, 
the isolated nucleic acid lacks one or more introns. Isolated nucleic acid molecules include 
sequences inserted into plasmids, cosmids, artificial chromosomes, and the like. Thus, in a 
specific embodiment, a recombinant nucleic acid is an isolated nucleic acid. An isolated 
protein may be associated with other proteins or nucleic acids, or both, with which it 

20 associates in the cell, or with cellular membranes if it is a membrane-associated protein. An 
isolated organelle, cell, or tissue is removed from the anatomical site in which it is found in 
an organism. An isolated material may be, but need not be, purified. 

The term "purified" as used herein refers to material that has been isolated 
under conditions that reduce or eliminate the presence of unrelated materials, i.e., 

25 contaminants, including native materials from which the material is obtained. For example, 
a purified protein is preferably substantially free of other proteins or nucleic acids with which 
it is associated in a cell; a purified nucleic acid molecule is preferably substantially free of 
proteins or other unrelated nucleic acid molecules with which it can be found within a cell. 
As used herein, the term "substantially free" is used operationally, in the context of analytical 
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testing of the material Preferably, purified material substantially free of contaminants is at 
least 50% pure; more preferably, at least 90% pure, and more preferably still at least 99% 
pure. Purity can be evaluated by chromatography, gel electrophoresis, immunoassay, 
composition analysis, biological assay, and other methods known in the art. 
5 Methods for purification are well-known in the art. For example, nucleic 

acids can be purified by precipitation, chromatography (including preparative solid phase 
chromatography, oligonucleotide hybridization, and triple helix chromatography), 
ultracentrifugation, and other means. Polypeptides and proteins can be purified by various 
methods including, without limitation, preparative disc-gel electrophoresis, isoelectric 

10 focusing, HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition 
chromatography, precipitation and salting-out chromatography, extraction, and 
countercurrent distribution. For some purposes, it is preferable to produce the polypeptide 
in a recombinant system in which the protein contains an additional sequence tag that 
facilitates purification, such as, but not limited to, a polyhistidine sequence, or a sequence 

1 5 that specifically binds to an antibody, such as FLAG and GST. The polypeptide can then be 
purified from a crude ly sate of the host cell by chromatography on an appropriate solid-phase 
matrix. Alternatively, antibodies produced against the protein or against peptides derived 
therefrom can be used as purification reagents. Cells can be purified by various techniques, 
including centrifugation, matrix separation {e.g. , nylon wool separation), panning and other 

20 immunoselection techniques, depletion (e. g . complement depletion of contaminating cells), 
and cell sorting (e.g., fluorescence activated cell sorting [FACS]). Other purification 
methods are possible. A purified material may contain less than about 50%, preferably less 
than about 75%, and most preferably less than about 90%, of the cellular components with 
which it was originally associated. The "substantially pure" indicates the highest degree of 

25 purity which can be achieved using conventional purification techniques known in the art. 

A "sample" as used herein refers to a biological material which can be tested 
for the presence of DISCI or DISC2 polypeptides, or for the presence of DISCI or DISC2 
nucleic acids, e.g., to evaluate a gene therapy or expression in a transgenic animal or to 
identify cells that express DISC 1 or DISC2. Such samples can be obtained from any source, 
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including tissue, blood and blood cells, including circulating hematopoietic stem cells (for 
possible detection of protein or nucleic acids), plural effusions, cerebrospinal fluid (CSF), 
ascites fluid, and cell culture. In a preferred embodiment, samples are obtained from brain 
tissue or from other tissues of the nervous system. 
5 Non-human animals include, without limitation, laboratory animals such as 

mice, rats, rabbits, hamsters, guinea pigs, etc.; domestic animals such as dogs and cats; and, 
farm animals such as sheep, goats, pigs, horses, and cows, and especially such animals made 
transgenic with human DISCI or DISC2. 

In preferred embodiments, the terms "about" and "approximately" shall 

1 0 generally mean an acceptable degree of error for the quantity measured given the nature or 
precision of the measurements. Typical, exemplary degrees of error are within 20 percent 
(%), preferably within 10%, and more preferably within 5% of a given value or range of 
values. Alternatively, and particularly in biological systems, the terms "about" and 
"approximately" may mean values that are within an order of magnitude, preferably within 

1 5 5-fold and more preferably within 2-fold of a given value. Numerical quantities given herein 
are approximate unless stated otherwise, meaning that the term "about" or "approximately" 
can be inferred when not expressly stated. 

The term "aberrant" or "abnormal", as applied herein refers to an activity or 
feature which differs from (a) a normal or activity or feature, or (b) an activity or feature 

20 which is within normal variations of a standard value. 

For example, an "abnormal" activity of a gene or protein such as the DISCI 
or DISC2 gene or protein refers to an activity which differs from the activity of the wild-type 
or native gene or protein, or which differs from the activity of the gene or protein in a healthy 
subject, e.g., a subject not afflicted with a disease associated with a specific allelic variant 

25 of a DISCI or DISC2 polymorphism. An activity of a gene includes, for instance, the 
transcriptional activity of the gene which may result from, e.g. , an aberrant promoter activity. 
Such an abnormal transcriptional activity can result, e.g., from one or more mutations in a 
promoter region, such as in a regulatory element thereof. An abnormal transcriptional 
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activity can also result from a mutation in a transcription factor involved in the control of 
gene expression. 

An activity of a protein can be aberrant because it is stronger than the activity 
of its native counterpart. Alternatively, an activity can be aberrant because it is weaker or 
5 absent related to the activity of its native counterpart. An aberrant activity can also be a 
change in an activity. For example an aberrant protein can interact with a different protein 
relative to its native counterpart. A cell can have an aberrant activity due to overexpression 
or underexpression of the gene encoding DISCI or DISC2. An aberrant DISCI or DISC2 
activity can result, e.g. , from a mutation in the gene, which results, e.g., in lower or higher 
10 binding affinity of a ligand or substrate to the protein encoded by the mutated gene. 

The term "molecule" means any distinct or distinguishable structural unit of 
matter comprising one or more atoms, and includes, for example, polypeptides and 
polynucleotides. 

15 Molecular Biology Definitions 

In accordance with the present invention, there may be employed 
conventional molecular biology, microbiology and recombinant DNA techniques within the 
skill of the art. Such techniques are explained fully in the literature. See, for example, 
Sambrook, Fitsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition 

20 (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (referred to 
herein as "Sambrook et al, 1989"); DNA Cloning: A Practical Approach, Volumes I and 
II (D.N. Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait ed. 1984); Nucleic Acid 
Hybridization (B.D. Hames & S.J. Higgins, eds. 1984); Animal Cell Culture (R.I. Freshney, 
ed. 1986); Immobilized Cells and Enzymes (IRL Press, 1986); B.E. Perbal, A Practical 

25 Guide to Molecular Cloning (1984); F.M. Ausubel et al (eds.), Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994). 

The term "polymer" means any substance or compound that is composed of 
two or more building blocks ('mers') that are repetitively linked together. For example, a 
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"dimer" is a compound in which two building blocks have been joined togther; a "trimer" 
is a compound in which three building blocks have been joined together; etc. 

The term "polynucleotide" or "nucleic acid molecule" as used herein refers 
to a polymeric molecule having a backbone that supports bases capable of hydrogen bonding 
5 to typical polynucleotides, wherein the polymer backbone presents the bases in a manner to 
permit such hydrogen bonding in a specific fashion between the polymeric molecule and a 
typical polynucleotide {e.g., single-stranded DNA). Such bases are typically inosine, 
adenosine, guanosine, cytosine, uracil and thymidine. Polymeric molecules include "double 
stranded" and "single stranded" DNA and RNA, as well as backbone modifications thereof 

1 0 (for example, methylphosphonate linkages). 

Thus, a "polynucleotide" or "nucleic acid" sequence is a series of nucleotide 
bases (also called "nucleotides"), generally in DNA and RNA, and means any chain of two 
or more nucleotides. A nucleotide sequence frequently carries genetic information, including 
the information used by cellular machinery to make proteins and enzymes. The terms 

15 include genomic DNA, cDNA, RNA, any synthetic and genetically manipulated 
polynucleotide, and both sense and antisense polynucleotides. This includes single- and 
double-stranded molecules; i.e., DNA-DNA, DNA-RNA, and RNA-RNA hybrids as well 
as "protein nucleic acids" (PNA) formed by conjugating bases to an amino acid backbone. 
This also includes nucleic acids containing modified bases, for example, thio-uracil, thio- 

20 guanine and fluoro-uracil. 

The polynucleotides herein may be flanked by natural regulatory sequences, 
or may be associated with heterologous sequences, including promoters, enhancers, response 
elements, signal sequences, polyadenylation sequences, introns, 5'- and 3'-non-coding 
regions and the like. The nucleic acids may also be modified by many means known in the 

25 art. Non-limiting examples of such modifications include methylation, "caps", substitution 
of one or more of the naturally occurring nucleotides with an analog, and internucleotide 
modifications such as, for example, those with uncharged linkages {e.g., methyl 
phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with charged 
linkages {e.g., phosphorothioates, phosphorodithioates, etc.). Polynucleotides may contain 
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one or more additional covalently linked moieties, such as proteins (e.g., nucleases, toxins, 
antibodies, signal peptides, poly-L-lysine, etc.), intercalators (e.g., acridine, psoralen, etc.), 
chelators (e.g. , metals, radioactive metals, iron, oxidative metals, etc. ) and alkylators to name 
a few. The polynucleotides may be derivatized by formation of a methyl or ethyl 
5 phosphotriester or an alkyl phosphoramidite linkage. Furthermore, the polynucleotides 
herein may also be modified with a label capable of providing a detectable signal, either 
directly or indirectly. Exemplary labels include radioisotopes, fluorescent molecules, biotin 
and the like. Other non-limiting examples of modification which may be made are provided, 
below, in the description of the present invention. 

10 A "polypeptide" is a chain of chemical building blocks called amino acids that 

are linked together by chemical bonds called "peptide bonds". The term "protein" refers to 
polypeptides that contain the amino acid residues encoded by a gene or by a nucleic acid 
molecule (e.g. , an mRNA or a cDNA) transcribed from that gene either directly or indirectly. 
Optionally, a protein may lack certain amino acid residues that are encoded by a gene or by 

1 5 an mRNA. For example, a gene or mRNA molecule may encode a sequence of amino acid 
residues on the N-terminus of a protein (i.e., a signal sequence) that is cleaved from, and 
therefore may not be part of, the final protein, A protein or polypeptide, including an 
enzyme, may be a "native" or "wild-type", meaning that it occurs in nature; or it may be a 
"mutant", "variant" or "modified", meaning that it has been made, altered, derived, or is in 

20 some way different or changed from a native protein or from another mutant. 

A "ligand" is, broadly speaking, any molecule that binds to another molecule. 
In preferred embodiments, the ligand is either a soluble molecule or the smaller of the two 
molecules or both. The other molecule is referred to as a "receptor". In preferred 
embodiments, both a ligand and its receptor are molecules (preferably proteins or 

25 polypeptides) produced by cells. In particularly preferred embodiments, a ligand is a soluble 
molecule and the receptor is an integral membrane protein (i.e., a protein expressed on the 
surface of a cell). However, the distinction between which molecule is the ligand and which 
is the receptor may be an arbitrary one. 
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The binding of a ligand to its receptor is frequently a step in signal 
transduction within a cell. Exemplary ligand-receptor interactions include, but are not 
limited to, binding of a hormone to a hormone receptor (for example, the binding of estrogen 
to the estrogen receptor) and the binding of a neurotransmitter to a receptor on the surface 
5 of a neuron. 

"Amplification" of a polynucleotide, as used herein, denotes the use of 
polymerase chain reaction (PCR) to increase the concentration of a particular DNA sequence 
within a mixture of DNA sequences. For a description of PCR see Saiki et al , Science 1 988, 
239:487, 

1 0 "Chemical sequencing" of DNA denotes methods such as that of Maxam and 

Gilbert (Maxam-Gilbert sequencing; see Maxam & Gilbert, Proc. Natl Acad Set U.S.A. 

1977, 74:560), in which DNA is cleaved using individual base-specific reactions. 

"Enzymatic sequencing" of DNA denotes methods such as that of Sanger 

(Sanger et al, Proc. Natl Acad. Sci. U.S.A. 1911, 74:5463) and variations thereof well 
15 known in the art, in a single-stranded DNA is copied and randomly terminated using DNA 

polymerase. 

A "gene" is a sequence of nucleotides which code for a functional "gene 
product". Generally, a gene product is a functional protein. However, a gene product can 
also be another type of molecule in a cell, such as an RNA {e.g. , a tRNA or a rRNA). For 

20 example, the gene DISC2, described supra, is not known to encode any particular protein 
and, rather, is believed encode a structural RNA molecule instead (see Millar et al , Human 
Molecular Genetics 2000, 9: 141 5-1423). For the purposes of the present invention, a gene 
also refers to an mRNA sequence which may be found in a cell. For example, measuring 
gene expression levels according to the invention may correspond to measuring mRNA 

25 levels. A gene may also comprise regulatory (i. e. , non-coding) sequences as well as coding 
sequences. Exemplary regulatory sequences include promoter sequences, which determine, 
for example, the conditions under which the gene is expressed. The transcribed region of the 
gene may also include untranslated regions including introns, a 5'-untranslated region (5 T - 
UTR) and a 3'-untranslated region (3'-UTR). 
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A "coding sequence" or a sequence "encoding" and expression product, such 
as a RNA, polypeptide, protein or enzyme, is a nucleotide sequence that, when expressed, 
results in the production of that RNA, polypeptide, protein or enzyme; Le., the nucleotide 
sequence "encodes" that RNA or it encodes the amino acid sequence for that polypeptide, 
5 protein or enzyme. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiation transcription of a downstream (3' direction) coding 
sequence. For purposes of defining the present invention, the promoter sequence is bounded 
at its 3' terminus by the transcription initiation site and extends upstream (5' direction) to 
1 0 include the minimum number of bases or elements necessary to initiate transcription at levels 
detectable above background. Within the promoter sequence will be found a transcription 
initiation site (conveniently found, for example, by mapping with nuclease SI), as well as 
protein binding domains (consensus sequences) responsible for the binding of RNA 
polymerase. 

15 A coding sequence is "under the control of or is "operative ly associated 

with" transcriptional and translational control sequences in a cell when RNA polymerase 
transcribes the coding sequence into RNA, which is then trans-RNA spliced (if it contains 
introns) and, if the sequence encodes a protein, is translated into that protein. 

The term "express" and "expression" means allowing or causing the 

20 information in a gene or DNA sequence to become manifest, for example producing RNA 
(such as rRNA or mRNA) or a protein by activating the cellular functions involved in 
transcription and translation of a corresponding gene or DNA sequence. A DNA sequence 
is expressed by a cell to form an "expression product" such as an RNA {e.g. , a mRNA or a 
rRNA) or a protein. The expression product itself, e.g., the resulting RNA or protein, may 

25 also said to be "expressed" by the cell. 

The term "transfection" means the introduction of a foreign nucleic acid into 
a cell. The term "transformation" means the introduction of a "foreign" {i.e., extrinsic or 
extracellular) gene, DNA or RNA sequence into a host cell so that the host cell will express 
the introduced gene or sequence to produce a desired substance, in this invention typically 
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an RNA coded by the introduced gene or sequence, but also a protein or an enzyme coded 
by the introduced gene or sequence. The introduced gene or sequence may also be called a 
"cloned" or "foreign" gene or sequence, may include regulatory or control sequences (e.g., 
start, stop, promoter, signal, secretion or other sequences used by a cell's genetic machinery). 
5 The gene or sequence may include nonfunctional sequences or sequences with no known 
function. A host cell that receives and expresses introduced DNA or RNA has been 
"transformed" and is a "transformant" or a "clone". The DNA or RNA introduced to a host 
cell can come from any source, including cells of the same genus or species as the host cell 
or cells of a different genus or species. 

10 The terms "vector", "cloning vector" and "expression vector" mean the 

vehicle by which a DNA or RNA sequence (e.g., a foreign gene) can be introduced into a 
host cell so as to transform the host and promote expression (e.g., transcription and 
translation) of the introduced sequence. Vectors may include plasmids, phages, viruses, etc 
and are discussed in greater detail below. 

15 A "cassette" refers to a DNA coding sequence or segment of DNA that codes 

for an expression product that can be inserted into a vector at defined restriction sites. The 
cassette restriction sites are designed to ensure insertion of the cassette in the proper reading 
frame. Generally, foreign DNA is inserted at one or more restriction sites of the vector 
DNA, and then is carried by the vector into a host cell along with the transmissible vector 

20 DNA. A segment or sequence of DNA having inserted or added DNA, such as an expression 
vector, can also be called a "DNA construct." A common type of vector is a "plasmid", 
which generally is a self-contained molecule of double-stranded DNA, usually of bacterial 
origin, that can readily accept additional (foreign) DNA and which can readily introduced 
into a suitable host cell. A large number of vectors, including plasmid and fungal vectors, 

25 have been described for replication and/or expression in a variety of eukaryotic and 
prokaryotic hosts. 

The term "host cell" means any cell of any organism that is selected, modified, 
transformed, grown or used or manipulated in any way for the production of a substance by 
the cell. For example, a host cell may be one that is manipulated to express a particular 
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gene, a DNA or RNA sequence, a protein or an enzyme. Host cells can further be used for 
screening or other assays that are described infra. Host cells may be cultured in vitro or one 
or more cells in a non-human animal {e.g., a transgenic animal or a transiently transfected 
animal). 

5 The term "expression system" means a host cell and compatible vector under 

suitable conditions, e.g. for the expression of a protein coded for by foreign DNA carried by 
the vector and introduced to the host cell. Common expression systems include E. coli host 
cells and plasmid vectors, insect host cells such as Sf9, Hi5 or S2 cells and Baculovirus 
vectors, Drosophila cells (Schneider cells) and expression systems, and mammalian host 

10 cells and vectors. For example, DISCI and/or DISC2 may be expressed in PC12, COS-1, 
or C 2 C 12 cells. Other suitable cells include CHO cells, HeLa cells, 293T (human kidney 
cells), mouse primary myoblasts, and NIH 3T3 cells. 

The term "heterologous" refers to a combination of elements not naturally 
occurring. For example, the present invention includes chimeric RNA molecules that 

15 comprise an rRNA sequence and a heterologous RNA sequence which is not part of the 
rRNA sequence. In this context, the heterologous RNA sequence refers to an RNA sequence 
that is not naturally located within the ribosomal RNA sequence. Alternatively, the 
heterologous RNA sequence may be naturally located within the ribosomal RNA sequence, 
but is found at a location in the rRNA sequence where it does not naturally occur. As 

20 another example, heterologous DNA refers to DNA that is not naturally located in the cell, 
or in a chromosomal site of the cell. Preferably, heterologous DNA includes a gene foreign 
to the cell. A heterologous expression regulatory element is a regulatory element operatively 
associated with a different gene that the one it is operatively associated with in nature. 

An "allele" refers to any one of a series of two or more genes that occupy the 

25 same position or locus on a chromosome. Generally, alleles refer to different forms of a gene 
that differ by at least one nucleic acid residue. Thus, as used here, the terms "allele" and 
"allelic variant" refer, not only to different forms of genomic sequences, but may also refer 
to different forms of sequences that are encoded by or otherwise derived from allelic variants 
of the genomic sequence. For example, the term allelic variant may refer to mRNA 
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sequences that are encoded by allelic variants of a genomic sequence, or to cDNA sequences 
that are derived from such variant mRNA sequences. As it is used herein, the term allelic 
variant can also refer to protein or polypeptides sequences which are derived from (e.g., 
encoded by) allelic variants of a particular gene. 
5 Allelic variants are usually described by comparing their nucleotide or (in the 

case of variant polypeptides) amino acid sequences to a common "wild-type" or "reference" 
sequence. Thus, a "wild-type" or "reference" allele of a gene refers to that allele of a gene 
having a genomic sequence designated as the wild-type sequence and/or encoding a 
polypeptide having an amino acid sequence that is also designated as a wild-type sequence. 

10 The wild-type allele may be arbitrarily selected from any of the different alleles that may 
exist for a particular gene. However, the allele is most typically selected to be the allele 
which is most prevalent in a population of individuals. Thus, for example, the wild-type 
DISCI and 2 genomic sequences have been arbitrarily selected, here, as the genomic 
sequence deposited in GenBank (Accession No. AF222980, GI No. 8163868; and 

15 AF222981, GI No. 8163870; DISC 1 and 2, respectively) and set forth here in SEQ ID 
NOS:l and 3. 

The term "polymorphism" refers, generally, to the coexistence of more than 
one form of a gene (e.g., more than one allele) within a population of individuals. The 
different alleles may differ at one or more positions of their nucleic acid sequences, which 
20 are referred to herein as "polymorphic locuses". When used herein to describe polypeptides 
that are encoded by different alleles of a gene, the term "polymorphic locus" also refers to 
the positions in an amino acid sequence that differ among variant polypeptides encoded by 
different alleles. 

The polymorphisms of the present invention include "single nucleotide 
25 polymorphisms" (SNPs) and microsatellite repeats. The term SNP refers to a polymorphic 
site occupied by a single nucleotide, which is the site of variation between allelic sequences. 
Typically, the polymorphic site of an SNP is flanked by highly conserved sequences (e.g., 
sequences that vary in lees than 1/100 and, more preferably, in less than 1/1000 individuals 
in a population). The polymorphic locus of an SNP may be a single base deletion, a single 
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base insertion, or a single base substitution. Single base substitutions are particularly 
preferred. 

A "microsatellite repeat" or "microsatellite", as the term is used herein, refers 
to a short sequence of repeating nucleotides within a nucleic acid. Typically, a microsatellite 
5 repeat comprises a repeating sequence of two (i.e., a dinucleotide repeat), three (i.e., a 
trinucleotide repeat), four (/. e. , a tetranucleotide repeat) or five (/. e. , a pentanucleotide repeat) 
nucleotides. Microsatellites of the invention therefore have the general formula (N 1? N 2 , . . 
. N,)„, wherein N represents a nucleic acid residue (e.g., adenine, thymine, cytosine or 
guanine), i represents the number of the last nucleotide in the microsatellite, and n represents 

1 0 the number of times the motif is repeated in the microsatellite locus. In one embodiment the 
number of nucleotides in a microsatellite motif (z) is about six, preferably between two and 
five, and more preferably two, three or four. The total number of repeats (n) in a 
microsatellite repeat may be, e.g , from one to about 60, preferably from 4 to 40, and more 
preferably from 1 0 to 30 when / = 2; is preferably between about 4-25, and more preferably 

1 5 between about 6-22 when i = 3 ; and is preferably between about 4-15, and more preferably 
between about 5-10 when / = 4. A DISCI and/or DISC2 nucleic acid of the invention may 
comprise any microsatellite repeat of the above general formula. However, the following 
motifs are particularly preferred: CA, TC. and, AATTG; as well as all complements and 
permutations of such motifs (for example, TG, GA, and CAATT. 

20 The term "locus" refers to a specific position on a chromosome. For example, 

the locus of a DISCI and/or DISC2 gene refers to the chromosomal position of that gene. 

The term "linkage" refers to the tendency of genes, alleles, loci or genetic 
markers to be inherited together as a result of their location on the same chromosome. 
Linkage may be measured, e.g., by the percent recombination between two genes, alleles, 

25 loci or genetic markers. 

The terms "mutant" and "mutation" mean any detectable change in genetic 
material, e.g., DNA, or any process, mechanism or result of such a change. This includes 
gene mutations, in which the structure (e.g., DNA sequence) of a gene is altered, any gene 
or DNA arising from any mutation process, and any expression product (e.g. , RNA, protein 
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or enzyme) expressed by a modified gene or DNA sequence. The term "variant" may also 
be used to indicate a modified or altered gene, DNA sequence, RNA, enzyme, cell, etc. ; I e. , 
any kind of mutant. 

"Sequence-conservative variants" of a polynucleotide sequence are those in 
5 which a change of one or more nucleotides in a given codon position results in no alteration 
in the amino acid encoded at that position. 

"Function-conservative variants" of a polypeptide or polynucleotide are those 
in which a given amino acid residue in the polypeptide, or the amino acid residue encoded 
by a codon of the polynucleotide, has been changed or altered without altering the overall 

1 0 conformation and function of the polypeptide. For example, function-conservative variants 
may include, but are not limited to, replacement of an amino acid with one having similar 
properties (for example, polarity, hydrogen bonding potential, acidic, basic, hydrophobic, 
aromatic and the like). Amino acid residues with similar properties are well known in the 
art. For example, the amino acid residues arginine, histidine and lysine are hydrophilic, basic 

1 5 amino acid residues and may therefore be interchangeable. Similar, the amino acid residue 
isoleucine, which is a hydrophobic amino acid residue, may be replaced with leucine, 
methionine or valine. Such changes are expected to have little or no effect on the apparent 
molecular weight or isoelectric point of the polypeptide. Amino acid residues other than 
those indicated as conserved may also differ in a protein or enzyme so that the percent 

20 protein or amino acid sequence similarity (e.g. , percent identity or homology) between any 
two proteins of similar function may vary and may be, for example, from 70% to 99% as 
determined according to an alignment scheme such as the Cluster Method, wherein similarity 
is based on the MEGALIGN algorithm. "Function-conservative variants" of a given 
polypeptide also include polypeptides that have at least 60% amino acid sequence identity 

25 to the given polypeptide as determined, e.g., by the BLAST or FASTA algorithms. 
Preferably, function-conservative variants of a given polypeptide have at least 75%, more 
preferably at least 85% and still more preferably at least 90% amino acid sequence identity 
to the given polypeptide and, preferably, also have the same or substantially similar 



-23- 



properties (e.g., of molecular weight and/or isoelectric point) or functions (e.g., biological 
functions or activities) as the native or parent polypeptide to which it is compared. 

The term "homologous", in all its grammatical forms and spelling variations, 
refers to the relationship between two proteins that possess a "common evolutionary origin", 
5 including proteins from superfamilies (e.g., the immunoglobulin superfamily) in the same 
species of organism, as well as homologous proteins from different species of organism (for 
example, myosin light chain polypeptide, etc.; see, Reeck et al 9 Cell 1987, 50:667). Such 
proteins (and their encoding nucleic acids) have sequence homology, as reflected by their 
sequence similarity, whether in terms of percent identity or by the presence of specific 

1 0 residues or motifs and conserved positions. 

The term "sequence similarity in ail its grammatical forms, refers to the 
degree of identity or correspondence between nucleic acid or amino acid sequences that may 
or may not share a common evolutionary origina (see, Reeck et al, supra). However, in 
common usage and in the instant application, the term ''homologous", when modified with 

15 an adverb such as "highly", may refer to sequence similarity and may or may not relate to 
a common evolutionary origin. 

In specific embodiments, two nucleic acid sequences are "substantially 
homologous" or "substantially similar" when at least about 80%, and more preferably at least 
about 90% or at least about 95% of the nucleotides match over a defined length of the nucleic 

20 acid sequences, as determined by a sequence comparison algorithm known such as BLAST, 
FASTA, DNA Strider, CLUSTAL, etc. An example of such a sequence is an allelic or 
species variant of the specific genes of the present invention. Sequences that are 
substantially homologous may also be identified by hybridization, e.g., in a Southern 
hybridization experiment under, e.g., stringent conditions as defined for that particular 

25 system. 

Similarly, in particular embodiments of the invention, two amino acid 
sequences are "substantially homologous" or "substantially similar" when greater than 80% 
of the amino acid residues are identical, or when greater than about 90% of the amino acid 
residues are similar (i.e., are functionally identical). Preferably the similar or homologous 
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polypeptide sequences are identified by alignment using, for example, the GCG (Genetics 
Computer Group, Program Manual for the GCG Package, Version 7, Madison Wisconsin) 
pileup program, or using any of the programs and algorithms described above (e.g. , BLAST, 
FASTA, CLUSTAL, etc.). 
5 As used herein, the term "oligonucleotide" refers to a nucleic acid, generally 

of at least 1 0, preferably at least 1 5, and more preferably at least 20 nucleotides, preferably 
no more than 100 nucleotides, that is hybridizable to a genomic DNA molecule, a cDNA 
molecule, or an mRNA molecule encoding a gene, mRNA, cDNA, or other nucleic acid of 
interest. Oligonucleotides can be labeled, e.g. , with 32 P-nucleotides or nucleotides to which 

10 a label, such as biotin or a fluorescent dye (for example, Cy3 or Cy5) has been covalently 
conjugated. In one embodiment, a labeled oligonucleotide can be used as a probe to detect 
the presence of a nucleic acid. In another embodiment, oligonucleotides (one or both of 
which may be labeled) can be used as PCR primers, either for cloning full length or a 
fragment of DISCI or DISC2, or to detect the presence of nucleic acids encoding DISCI or 

1 5 DISC2. In a further embodiment, an oligonucleotide of the invention can form a triple helix 
with a DISCI DNA molecule or with a DISC2 DNA molecule. Generally, oligonucleotides 
are prepared synthetically, preferably on a nucleic acid synthesizer. Accordingly, 
oligonucleotides can be prepared with non-naturally occurring phosphoester analog bonds, 
such as thioester bonds, etc. 

20 The present invention provides antisense nucleic acids (including ribozymes), 

which may be used to inhibit expression of a DISCI or a DISC2 gene or their respective 
gene products. An "antisense nucleic acid" is a single stranded nucleic acid molecule which, 
on hybridizing under cytoplasmic conditions with complementary bases in an RN A or DNA 
molecule, inhibits the latter' s role. If the RNA is a messenger RNA transcript, the antisense 

25 nucleic acid is a countertranscript or mRNA-interfering complementary nucleic acid. As 
presently used, "antisense" broadly includes RNA-RNA interactions, RNA-DNA 
interactions, triple helix interactions, ribozymes and RNase-H mediated arrest. Antisense 
nucleic acid molecules can be encoded by a recombinant gene for expression in a cell (e.g. , 
U.S. Patent No. 5,814,500; U.S. Patent No. 5,81 1,234), or alternatively they can be prepared 



synthetically (e.g. , U.S. Patent No. 5,780,607). Other specific examples of antisense nucleic 
acid molecules of the invention are provided infra. 

Specific non-limiting examples of synthetic oligonucleotides envisioned for 
this invention include, in addition to the nucleic acid moieties described above, 
5 oligonucleotides that contain phosphorothioates, phosphotriesters, methyl phosphonates, 
short chain alkyl, or cycloalkyl intersugar linkages or short chain heteroatomic or 
heterocyclic intersugar linkages. Most preferred are those with CH 2 -NH-0-CH 2 , CH 2 - 
N(CH 3 )-0-CH 2 , CH 2 -0-N(CH 3 )-CH 2 , CH 2 -N(CH 3 )-N(CH 3 )-CH 2 and 0-N(CH 3 )-CH 2 -CH 2 
backbones (where phosphodiester is 0-P0 2 -0-CH 2 ). US Patent No, 5,677,437 describes 

1 0 heteroaromatic olignucleoside linkages. Nitrogen linkers or groups containing nitrogen can 
also be used to prepare oligonucleotide mimics (U.S Patents Nos. 5,792,844 and 5,783,682). 
US Patent No. 5,637,684 describes phosphoramidate and phosphorothioamidate oligomeric 
compounds. Also envisioned are oligonucleotides having morpholino backbone structures 
(U.S. Pat. No. 5,034,506). In other embodiments, such as the peptide-nucleic acid (PNA) 

15 backbone, the phosphodiester backbone of the oligonucleotide may be replaced with a 
polyamide backbone, the bases being bound directly or indirectly to the aza nitrogen atoms 
of die polyamide backbone (Nielsen ei al, Science 254:1497, 1991). Other synthetic 
oligonucleotides may contain substituted sugar moieties comprising one of the following at 
the 2' position: OH, SH, SCH 3 , F, OCN, 0(CH 2 ) n NH 2 or 0(CH 2 ) n CH 3 where n is from 1 to 

20 about 1 0; C x to C 10 lower alkyl, substituted lower alkyl, alkaryl or aralkyl; CI; Br; CN; CF 3 ; 
OCF 3 ; 0-; S-, or N-alkyl; 0-, S-, or N-alkenyl; SOCH 3 ; S0 2 CH 3 ; ON0 2 ;N0 2 ; N 3 ; NH 2 ; 
heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; polyalkylamino; substitued silyl; 
a fluorescein moiety; an RNA cleaving group; a reporter group; an intercalator; a group for 
improving the pharmacokinetic properties of an oligonucleotide; or a group for improving 

25 the pharmacodynamic properties of an oligonucleotide, and other substituents having similar 
properties. Oligonucleotides may also have sugar mimetics such as cyclobutyls or other 
carbocyclics in place of the pentofuranosy 1 group. Nucleotide units having nucleosides other 
than adenosine, cytidine, guanosine, thymidine and uridine, such as inosine, may be used in 
an oligonucleotide molecule. 
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A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, 
such as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid 
molecule can anneal to the other nucleic acid molecule under the appropriate conditions of 
temperature and solution ionic strength (see Sambrook et al 7 supra). The conditions of 
5 temperature and ionic strength determine the "stringency" of the hybridization. For 
preliminary screening for homologous nucleic acids, low stringency hybridization 
conditions, corresponding to a T m (melting temperature) of 55 °C, can be used, e.g., 5x SSC, 
0.1% SDS, 0.25% milk, and no formamide; or 30% formamide, 5x SSC, 0.5% SDS). 
Moderate stringency hybridization conditions correspond to a higher T m , e.g., 40% 

1 0 formamide, with 5x or 6x SCC. High stringency hybridization conditions correspond to the 
highest T m5 e.g. , 50% formamide, 5x or 6x SCC SCC is a 0. 1 5M NaC 1 , 0.01 5M Na-citrate. 
Hybridization requires that the two nucleic acids contain complementary sequences, although 
depending on the stringency of the hybridization, mismatches between bases are possible. 
The appropriate stringency for hybridizing nucleic acids depends on the length of the nucleic 

1 5 acids and the degree of complementation, variables well known in the art. The greater the 
- degree of similarity or homology between two nucleotide sequences, the greater the value 
of T m for hybrids of nucleic acids having those sequences. The relative stability 
(corresponding to higher T m ) of nucleic acid hybridizations decreases in the following order: 
RNA:RNA, DNA: RNA, DNA:DNA. For hybrids of greater than 100 nucleotides in length, 

20 equations for calculating T m have been derived (see Sambrook et al , supra, 9.50-9.51). For 
hybridization with shorter nucleic acids, i.e., oligonucleotides, the position of mismatches 
becomes more important, and the length of the oligonucleotide determines its specificity (see 
Sambrook et al , supra, 11.7-11 .8). A minimum length for a hybridizable nucleic acid is at 
least about 1 0 nucleotides; preferably at least about 1 5 nucleotides; and more preferably the 

25 length is at least about 20 nucleotides. 

In a specific embodiment, the term "standard hybridization conditions" refers 
to a T m of 55 °C, and utilizes conditions as set forth above. In a preferred embodiment, the 
T m is 60°C; in a more preferred embodiment, the T m is 65°C. In a specific embodiment, 
"high stringency" refers to hybridization and/or washing conditions at 68 °C in 0.2XSSC, at 

-27- 



42 °C in 50% formamide, 4XSSC, or under conditions that afford levels of hybridization 
equivalent to those observed under either of these two conditions. 

Suitable hybridization conditions for oligonucleotides (e.g., for 
oligonucleotide probes or primers) are typically somewhat different than for full-length 
5 nucleic acids (e.g., full-length cDNA) ? because of the oligonucleotides* lower melting 
temperature. Because the melting temperature of oligonucleotides will depend on the length 
of the oligonucleotide sequences involved, suitable hybridization temperatures will vary 
depending upon the oligoncucleotide molecules used. Exemplary temperatures may be 37 °C 
(for 14-base oligonucleotides), 48 °C (for 17-base oligonucleotides), 55 °C (for 20-base 
10 oligonucleotides) and 60 °C (for 23 -base oligonucleotides). Exemplary suitable 
hybridization conditions for oligonucleotides include washing in 6x SSC/0.05% sodium 
pyrophosphate, or other conditions that afford equivalent levels of hybridization. 



5. 2 DISCI and DISC2 Nucleic Acids 

15 The DISCI and DISC2 nucleic acid molecules of the invention are defined 

above, and include DNA and RNA molecules as well as nucleic acid molecules comprising 
any of the modifications (e.g., modified bases and/or backbone) described supra. 

DISCI 

20 With respect, first, to DISCI nucleic acids, a DISCI nucleic acid molecule 

comprises a nucleic acid sequence that encodes a DISCI polypeptide (described in detail, 
infra), the complement of a nucleic acid sequence that encodes a DISCI polypeptide, and 
fragments thereof Thus, in one preferred embodiment a DISCI nucleic acid molecules of 
the invention comprises nucleotide sequences that encode the amino acid sequence set forth 

25 in FIG, 2 (SEQ ID NO:2), such as the nucleotide sequence set forth in FIGS. 1A-C (SEQ 
ID NO:l) and, in particular, the nucleotide sequence of positions 54-2616 of the sequence 
set forth in FIGS. 1A-C(SEQIDN0:1). In another preferred embodiment, a DISCI nucleic 
acid molecule of the invention comprises a nucleic acid sequence that encodes a splice 
variant of the DISCI polypeptide set forth in FIG. 2 and in SEQ ID NO:2. For example, a 
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DISCI nucleic acid molecule of the invention may encode a DISCI polypeptide that is 
essentially identical to the DISCI polypeptide sequence set forth in FIG. 2 (SEQ ID NO:2) ? 
except that the amino acid residues corresponding to positions 748-768 of SEQ ID NO:2 are 
omitted. In a preferred aspect of this embodiment, the DISCI nucleic acid molecule 
5 comprises a coding sequence that is essentially identical to the coding sequence set forth in 
FIGS. 1 A-C (SEQ ID NO: 1 ) except that the sequence corresponding to nucleotide positions 
2295-2360 is omitted. 

Alternatively, a DISCI nucleic acid molecule of the invention may encode 
one or more exons of a DISCI gene product. For example, in one embodiment a DISCI 

1 0 nucleic acid molecule of the invention may encode the sequence of amino acid residues 768- 
768 of the sequence set forth in FIG. 2 (SEQ ID NO:2). In a preferred aspect of the 
embodiment, the DISCI nucleic acid molecule comprises the sequence of nucleotide 
positions 2295-2360 of the sequence set forth in FIGS. 1A-C (SEQ ID NO:l). 

In other embodiments, DISCI nucleic acid molecules of the invention 

1 5 comprise a nucleotide sequence that encodes one or more domains of a DISCI polypeptide. 
For example, a DISCI nucleic acid molecule of the invention may encode an N-terminal 
domain of a DISCI polypeptide (e.g., amino acid residues 1-247 of SEQ ID NO:2), a C- 
terminal domain of a DISCI polypeptide (e.g., amino acid residues 348-854 of SEQ ID 
NO:2), one or more a-helix domains of a DISCI polypeptide (e.g. , amino acid residues 367- 

20 394, 452-500 or 602-630 of SEQ ID NO:2), a coiled-coil domain of a DISCI polypeptide 
(e.g. , two or more a-helix domain of a DISCI polypeptide and, optionally, the intervening 
amino acid residues; for example, in a preferred embodiment the coiled-coil domain 
comprises amino acid residues 367-630 of SEQ ID NO:2), or a globular domain of a DISC 1 
polypeptide. 

25 In another, preferred embodiment, a DISCI nucleic acid molecule of the 

invention is one which comprises a nucleotide sequence selected from the sequences set forth 
in TABLE 3 shown in the Examples, infra, or a portion thereof. For example, a DISCI 
nucleic acid molecule of the invention may also be a nucleic acid molecule comprising the 
sequence for one or more introns indicated in the DISCI genomic sequences set forth in 
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TABLE 3; for one or more exons, delineated in TABLE 4, infra, of the DISCI genomic 
sequences in TABLE 3; for the 5'-promoter region comprised in one or more of the genomic 
sequence set forth in TABLE 3, or a combination of one or more of the individual exons, 
introns and/or the 5 r -promoter region from the DISCI genomic sequence set forth in 
5 TABLE 3. 

In other embodiments, DISCI nucleic acid molecules of the invention 
comprise a nucleotide sequence which encodes one or more domains of a DISCI 
polypeptide. For example, a DISCI nucleic acid molecule of the invention may encode an 
N-terminal domain of a DISCI polypeptide (e.g., amino acid residues 1-347 of SEQ ID 
1 0 NO:2), a C-terminal domain of a DISCI polypeptide (e.g., amino acid residues 348-854 of 
SEQ ID NO:2), one or more a-helix domains of a DISCI polypeptide (e.g., amino acid 
residues 367-394, 452-500 or 602-630 of SEQ ID NO:2) or a globular domain of a DISCI 
polypeptide. 

The DISCI nucleic acid molecules ofthe present invention may also comprise 
15 a nucleic acid sequence that encodes one or more fragments (e.g., an epitope) of a DISCI 
polypeptide. 

The DISCI nucleic acid molecules of the present invention also include 
nucleic acid molecules that comprise modified or variant DISCI nucleic acid sequences, 
including DISCI nucleic acid molecules that have one or more nucleic acid insertions, 

20 substitutions, deletions or truncations. Such modified and variant DISCI nucleic acid 
molecules includes ones that comprise coding sequences for modified DISCI polypeptides 
(e.g., having amino acid substitutions, deletions or truncations) and for variants (including 
analogs and homologs from the same and different species of organism) of DISCI 
polypeptides. In preferred embodiments, such nucleic acid molecules have at least 50%, 

25 preferably at least 75% and more preferably at least 90% sequence identity to a DISCI 
coding nucleotide sequence such as the coding sequence comprising nucleotide positions 54 
through 2616 ofthe sequence set forth in FIGS. 1A-C (SEQ ID NO:l). 

Preferably, the modified or variant DISCI nucleic acid molecules of the 
invention comprise sequence modifications or variations that correlate with a 
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neuropsychiatric disorder, such as schizophrenia, schizoaffective disorder, bipolar affective 
disorder, unipolar affective disorder and adolescent conduct disorder. Particularly preferred 
variant DISC 1 nucleic acids include those described in the Examples, infra. 

As used here, the term "homologous 11 , in all its grammatical forms and 
5 spelling variations, refers to the relationship between nucleic acids, and the proteins or gene 
products they encode, that are understood to possess a "common evolutionary origina", 
including genes and proteins from superfamilies (for example, the immunoglobulin 
superfamily) and homologous proteins and nucleic acids (e.g. , genes) from different species. 
See, for example, Reeck et al, Cell 1987, 50:667. Corresponding nucleic acid molecules 

1 0 (e.g. , genes) and proteins from different species are referred to as "orthologs." Homologous 
and orthologous proteins, and their encoding nucleic acids, have sequence homology which 
is reflected by their sequence similarity. Such sequence similarity may be indicated, for 
example, by the percent of sequence similarity (e.g. , a percentage of nucleotide or amino acid 
sequence identity or homology), or by the presence of certain nucleotides or certain amino 

1 5 acid residues and conserved positions, or by the presence of specific "sequence motifs" (I e. , 
specific sequences of nucleotides or amino acid residues) at conserved positions. 

The term "sequence similarity", in all its grammatical forms, refers to the 
degree of identity or correspondence between nucleotide or amino acid sequences. Except 
as otherwise noted herein, the term "homologous" refers merely to sequence similarity and 

20 does not necessarily relate to a common evolutionary origin 

In specific embodiments, two polypeptide sequences are "substantially 
homologous" or "substantially similar" when the polypeptides are at least 35-40% similar 
as determined by one of the algorithms disclosed herein. Preferably, two polypeptides are 
substantially homologous when the polypeptides are at least about 60%, and more preferably 

25 at least about 90 or 95% similar in one or more highly conserved domains or, for alleles, 
across the entire amino acid sequence. Two nucleic acid sequences are "substantially 
homologous" or "substantially similar" when the polypeptides they encode are substantially 
homologous, as defined above. Alternatively, two nucleic acid sequences may be 
"substantially homologous" or "substantially similar" if their nucleotide sequences are, 
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themselves, at least about 35-40% similar, preferably at least about 60% similar and more 
preferably at least about 90 to 95% similar, as determined by one of the algorithms disclosed 
herein. 

Sequence comparison algorithms that may be used to compare amino acid or 
5 nucleic acid sequences include the BLAST algorithms (e.g., BLASTP, BLASTN and 
BLASTX), FASTA, DNA Strider, the GCG (Genetics Computer Group, Program Manual 
for the GCG Package, Version 7, Madison, Wisconsin) pileup program, etc. Unless 
otherwise stated, all sequence comparisons referred to herein are done using the default 
parameters provided with these algorithms. 

10 In one embodiment, the homologous DISCI nucleic acids of the invention 

(including, e.g., DISCI analogs or variants) encode functionally similar molecules (ie., 
molecules that perform one or more DISCI functions or have one or more DISCI 
bioactivities). Thus, in a specific embodiment, an analog or variant of a DISCI polypeptide 
is a function-conservative variant. "Function-conservative variants" of a polypeptide are 

1 5 those in which a given amino acid residue of the polypeptide has been changed without 
altering the overall conformation and/or function (e.g. , bioactivity) of the polypeptide. Such 
changes include, but are not limited to, replacement of an amino acid with one having similar 
properties; such as similar properties of polarity, hydrogen bonding potential, acidity, 
alkalinity, hydrophobicity, aromaticity and the like. For example and not by way of 

20 limitation, arginine, histidine and lysine or hydrophilic-basic amino acids and may be 
interchangeable. Similarly, isoleucine, a hydrophobic amino acid, may be replaced with 
leucine, methionine or valine. Such changes are expected to have little or no effect on the 
apparent molecular weight or isoelectric point of the protein or polypeptide. 

In another embodiment, the homologous DISCI nucleic acids of the invention 

25 (including, e.g. , DISCI analogs or variants) are sequence conservative variants. "Sequence- 
conservative variants" of a nucleic acid are ones that have a different polynucleotide 
sequence but encode the same amino acid sequence. 

In preferred embodiments, the homologous DISCI nucleic acids of the 
invention (including, e.g., DISCI analogs or variants) are ones which are associated with a 
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neuropsychiatry disorder, such as schizophrenia, schizoaffective disorder, bipolar affective 
disorder, unipolar affective disorder and adolescent conduct disorder. 

Examples of homologous DISCI nucleic acids include allelic or species 
variants of the DISCI nucleic acid sequence set forth in FIGS. 1A-C (SEQ ID NO:l). 
5 Preferred examples of such homologous sequences include the particular variants set forth 
in the Examples, infra. 

Alternatively, DISCI nucleic acid molecules of the invention may also be 
ones that hybridize to the complement of another DISCI nucleic acid molecules (e.g., in a 
Southern blot assay) under any of the hybridization conditions defined in this apecification. 

10 In a particular embodiment, a DISCI nucleic acid molecule of the invention comprises a 
nucleotide sequence which hybridizes to the complement of a DISCI nucJeic acid sequence, 
such as the sequence set forth in FIGS. 1A-C (SEQ ID NO:l) or a portion thereof, under 
highly stringent hybridization conditions that comprise 50% formamide and 5X or 6X SSC. 
In other embodiments, the nucleic acid molecule of the invention hybridizes to a complement 

15 of a DISCI nucleic acid sequence (e.g., the sequence set forth in FIGS. 1A-C and in SEQ 
ID NO:l, or a portion thereof) under moderately stringent hybridization conditions (e.g., 
40% formamide with 5X or 6X SSC), or under low stringency hybridization conditions (e.g. , 
in 5X SSC, 0.1% SDS, 0.25% mile, no formamide, 30% formamid, 5X SSC or 0.5% SDS). 
Particularly preferred hybridization conditions comprise hybridization at 42 °C in a low 

20 stringency hybridization buffer (e.g. , 30% formamide, 1 0 mM Tris pH 7.6, 2.5X Denhardt's 
solution, 5X SSC, 0.5% SDS and 1.5 mg/ml sonicated salmon sperm DNA) followed by 
washing (preferably twice) at 50 °C using a low stringency washing buffer (e.g. , 0.5X SSC 
and 1%SDS). 

In other embodiments, the DISCI nucleic acid molecules of the invention 
25 comprise fragments of a full length DISCI sequence. For example, in preferred 
embodiments such DISCI nucleic acid fragments comprise a nucleotide sequence that 
corresponds to a sequence of at least 10 nucleotides, preferably at least 1 5 nucleotides, more 
preferably at least 20 nucleotides and still more preferably at least 25 nucleotides of the full 
length DISCI nucleotide sequence. In specific embodiments, the fragments correspond to 
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a portion {e.g., of at least 10, 15, 20 or 25 nucleotides) of the DISCI sequence set forth in 
FIGS. 1A-C (SEQ ID NO:l) or of another nucleotide sequence encoding the polypeptide 
sequence set forth in FIG. 2 (SEQ ID NO:2). In another embodiment, the fragments 
correspond to a portion (e.g., of at least 10, 15, 20, or 25 nucleotides) of a variant DISCI 
5 sequence, such as a DISCI sequence having a polymorphism. For example, in preferred 
embodiments such fragments may comprise one or more of the polymorphisms described 
in the Examples, infra. 

In other preferred embodiments, the DISCI nucleic acid fragments of the 
invention comprise sequence of at least 10, preferably at least 15, more preferably at least 

10 20 and still more preferably at least 25 nucleotides that are complementary to and/or 
hybridize to a full length DISCI nucleic acid sequence (e.g. , the sequence set forth in FIGS. 
1A-C and in SEQ ID NO: 1) or to a portion thereof. In preferred embodiments, the nucleic 
acid fragments are complementary to a variant DISCI nucleic acid sequence (for example, 
any of the variant sequences described in the Examples, infra) and/or specifically hybridize 

15 to such a variant DISCI nucleic acid sequence. More preferably, the nucleic acid fragments 
of the invention are fragments which specifically hybridize to a variant DISCI nucleic acid 
sequence and do not hybiridze to a regular or wild-type DISCI sequence under the same, 
defined conditions. 

Suitable hybridization conditions for such oligonucleotides are described 

20 supra, and include washing in 6X SSC/0.05% sodium pyrophosphate. Because the melting 
temperature of oligonucleotides will generally depend on the length of the oligonucleotide 
sequence, suitable hybridization temperatures may be expected to vary depending upon the 
oligonucleotide molecule used. Exemplary temperatures include 37 °C (for 14-base 
oligonucleotides), 48 °C (for 1 7-base oligonucleotides), 5 5 °C (for 20-base oligonucleotides) 

25 and 60 °C (for 23 or 25-base oligoncueltodies). 

The DISCI nucleic acid molecules of the invention also include "chimeric" 
DISCI nucleic acids. Such chimeric nucleic acid molecules are polynucleotides which 
comprise at least one DISCI nucleic acid sequence, and also at least one non-DISCl nucleic 
acid sequence. The DISCI nucleic acid sequence contained in a chimeric DISCI nucleic 
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acid molecule may be any of the wild-type, variant or polymorphic full length DISCI 
sequences described here above, or may be any portion thereof (for example, a portion 
encoding a particular domain or containing a particular polymorphism). The non-DISCl 
nucleic acid sequence may be, for example, a regulatory sequence (e.g., a promoter 
5 sequence) that is derived from another, non-DISC 1 gene and is not normally associated with 
a naturally occurring DISCI gene. The non-DISCl nucleic acid sequence may also be a 
coding sequence for another, non-DISCl polypeptide such as a FLAG, a histidine tag, 
glutathione S-transferase (GST), hemaglutinin, (3-galactosidase, Green Fluorescent Protein 
(GFP), thioreductase or an immunoglobulin protein. In preferred embodiments, a chimeric 
1 0 nucleic acid molecule of the invention encodes a DISC 1 fusion polypeptide of the invention 
(described below). 

DISC2 

The DISC2 gene, as described supra, is a gene on human chromosome 1 
15 which overlaps with, but is transcribed in the opposite direction of the human DISC I gene 
described above. Unlike DISCI, DISC2 nucleic acids are not believed to encode 
polypeptides but, rather, encode a structural RN A gene product. 

In preferred embodiments, a DISC2 nucleic acid molecule of the invention 
comprises the nucleotide sequence set forth in FIGS, 3A-G (SEQ ID NO^), or a portion or 
20 fragment thereof. In another preferred embodiment, a DISC2 nucleic acid molecule of the 
invention may be a splice variant of the DISC2 sequence set forth in FIGS. 3A-G (SEQ ID 
NO:3). For example, DISC2 transcripts of at least 9.5 kb, and of approximately 6, 3 and 
2.5 kb have been detected in heart tissue (see, Millar et al. , Human Molecular Genetics 2000, 
9:1415-1425). Such transcripts are among the DISC2 nucleic acid molecules of the present 
25 invention and, unless otherwise stated, may be used in any of the compositions and methods 
of the invention which recite use of a DISC2 nucleic acid. 

The DISC2 nucleic acids of the present invention also include nucleic acid 
molecules that comprise modified or variant DISC2 nucleic acid sequences, including DISC2 
sequences that have on or more nucleic acid insertions, substitutions, deletions or 
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truncations. Such variant DISC2 nucleic acids may be, for example, analogs or homologs 
of a DISC2 nucleic acid in the same or in different species of organism. In preferred 
embodiments, such sequences have at least 50%, preferably at least 75% and more preferably 
at least 90% sequence identity to a DISC2 nucleotide sequence, such as the one set forth in 
5 FIGS. 3A-G (SEQ ID NO:3). The meanings of the terms "homologous" and "sequence 
identity" are as defined, supra, for DISCI nucleic acid sequences of the invention. 

In preferred embodiments, the modified or variant DISC2 nucleotide 
sequences are ones which are associated with a neuropsychiatric disorder such as 
schizophrenia, schizoaffective disorder, bipolar affective disorder, unipolar disorder and 

10 adolescent conduct disorder. Variant DISCI nucleotides are described in the Examples, 
infra. Although specific DISC2 polymorphisms are not explicitly identified herein, the 
similarities between DISCI and DISC2 enables a skilled artisan to practice the invention 
with respect to DISC2 based on the specific information provided for DISCI. 

Alternatively, DISC2 nucleic acid molecules of the invention may also be 

15 ones that hybridize to the complement of another DISC2 nucleic acid molecules (e.g., in a 
Southern blot assay) under any of the hybridization conditions defined in this apecification. 
In a particular embodiment, a DISC2 nucleic acid molecule of the invention comprises a 
nucleotide sequence which hybridizes to the complement of a DISC2 nucleic acid sequence, 
such as the sequence set forth in FIGS. 3A-G (SEQ ID NO:3) or a portion thereof, under 

20 highly stringent hybridization conditions that comprise 50% formamide and 5X or 6X SSC. 
In other embodiments, the nucleic acid molecule of the invention hybridizes to a complement 
of a DISC2 nucleic acid sequence (e.g., the sequence set forth in FIGS. 3A-G and in SEQ 
ID NO:3, or a portion thereof) under moderately stringent hybridization conditions (e.g., 
40% formamide with 5X or 6X SSC), or under low stringency hybridization conditions (e.g. , 

25 in 5X SSC, 0.1% SDS, 0.25% mile, no formamide, 30% formamid, 5X SSC or 0.5% SDS). 
Particularly preferred hybridization conditions comprise hybridization at 42 °C in a low 
stringency hybridization buffer (e.g., 30% formamide, 10 mM Tris pH 7.6, 2.5X Denhardfs 
solution, 5X SSC, 0.5% SDS and 1.5 mg/ml sonicated salmon sperm DNA) followed by 
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washing (preferably twice) at 50 °C using a low stringency washing buffer (e.g., 0.5X SSC 
andl%SDS). 

In other embodiments, the DISC2 nucleic acid molecules of the invention 
comprise fragments of a full length DISC2 sequence. For example, in preferred 
5 embodiments such OSCAR nucleic acid fragments comprise a nucleotide sequence that 
corresponds to a sequence of at least 1 0 nucleotides, preferably at least 1 5 nucleotides, more 
preferably at least 20 nucleotides and still more preferably at least 25 nucleotides of the full 
length DISC2 nucleotide sequence, In specific embodiments, the fragments correspond to 
a portion (e.g., of at least 10, 15, 20 or 25 nucleotides) of the DISC2 sequence set forth in 

1 0 FIGS. 3 A-G (SEQ ID NO:3). In another embodiment, the fragments correspond to a portion 
(e.g. , of at least 1 0, 1 5, 20, or 25 nucleotides) of a variant DISC2 sequence, such as a DISC2 
sequence having a polymorphism. Variant DISC 1 nucleotide fragments are described in the 
Examples, infra. Again, although specific DISC2 fragments containing polymorphisms are 
not explicitly identified herein, the similarities between DISC 1 and DISC2 enables a skilled 

15 artisan to practice the invention with respect to DISC2 based on the specific information 
provided for DISCI. 

In other preferred embodiments, the DISC2 nucleic acid fragments of the 
invention comprise sequence of at least 10, preferably at least 1 5, more preferably at least 
20 and still more preferably at least 25 nucleotides that are complementary to and/or 

20 hybridize to a full length DISC2 nucleic acid sequence (e.g. , the sequence set forth in FIGS. 
3A-G and in SEQ ID NO:3) or to a portion thereof In preferred embodiments, the nucleic 
acid fragments are complementary to a variant DISC2 nucleic acid sequence (for example, 
any of the variant sequences described in the Examples, infra) and/or specifically hybridize 
to such a variant DISC2 nucleic acid sequence. More preferably, the nucleic acid fragments 

25 of the invention are fragments which specifically hybridize to a variant DISC2 nucleic acid 
sequence and do not hybridize to a regular or wild-type DISC2 sequence under the same, 
defined conditions. 

Suitable hybridization conditions for such oligonucleotides are described 
supra, and include washing in 6X SSC/0.05% sodium pyrophosphate. Because the melting 
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temperature of oligonucleotides will generally depend on the length of the oligonucleotide 
sequence, suitable hybridization temperatures may be expected to vary depending upon the 
oligonucleotide molecule used. Exemplary temperatures include 37 °C (for 14-base 
oligonucleotides), 48 °C (for 17-base oligonucleotides), 55 °C (for 20-base oligonucleotides) 
5 and 60 °C (for 23 or 25-base oligoncueltodies). 

The DISC2 nucleic acid molecules of the invention also include "chimeric" 
DISC2 nucleic acids. Such chimeric nucleic acid molecules are polynucleotides which 
comprise at least one DISC2 nucleic acid sequence, and also at least one non-DISC2 nucleic 
acid sequence. The DISC2 nucleic acid sequence contained in a chimeric DISC2 nucleic 

10 acid molecule may be any of the wild-type, variant or polymorphic full length DISC2 
sequences described here above, or may be any portion thereof (for example, a portion 
encoding a particular domain or containing a particular polymorphism). The non-DISC2 
nucleic acid sequence may be, for example, a regulatory sequence (eg., a promoter 
sequence) that is derived from another, non-DISC2 gene and is not normally associated with 

15 a naturally occurring D1SC2 gene. The non-DISC2 nucleic acid sequence may also be a 
coding sequence for another, non-DISC2 polypeptide such as a FLAG, a histidine tag, 
glutathione S-transferase (GST), hemaglutinin, P-galactosidase, Green Fluorescent Protein 
(GFP), thioreductase or an immunoglobulin protein. 

20 Nucleic acid molecules comprising the above described DISCI and DISC2 

fragments are useful, for example, as oligonucleotide probes and primers {e.g. , PCR primers) 
to detect and amplify other nucleic acid molecules encoding a DISC 1 polypeptide or DISC2 
RNA, including genes that encode variant DISCI polypeptides or DISC2 RNA such as 
analogs, homologs and variants. Oligonucleotide fragments of the invention may also be 

25 used, e.g. , as antisense nucleic acids, triple helix forming oligonucleotides or as ribozymes; 
e.g., to modulate levels of DISCI or DISC2 gene expression or transcription in cells. 

For example, FIG. 4 describes several specific nucleic acids, comprising the 
nucleotide sequences set forth in SEQ ID NOS:44-127, that may be used to amplify regions 
of a DISC 1 or DISC2 gene or genomic sequence as described in the Examples. In particular, 
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these sequences are used in the Examples to amplify particular segments of the DISCI and 
DISC2 genomic sequence set forth in SEQ ID NOS:l and 3, and identify nucleic acid 
mutations or polymorphisms (including single nucleotide polymorphisms) which correlate 
with and are therefore associated with a neuropsychiatric disorder. The nucleic acids of the 
5 present invention therefore include ones which comprise any of the nucleotide sequences set 
forth in FIG. 4, and in SEQ ID NOS:44-127. 

The "primers" and "probes" of the invention are nucleic acid sequence which 
can be used for amplifying and/or identifying a DISCI or DISC2 gene sequence. Primers 
can be used alone in a detection method, or a primer can be used together with at least one 

1 0 other primer or probe in a detection method. Primers can also be used to amplify at least a 
portion of a nucleic acid. Probes of the invention refer to nucleic acids which hybridize to 
the region of interest and which are not further extended. For example, a probe is a nucleic 
acid which specifically hybridizes to a polymorphic region of a DISCI or DISC2 gene, and 
which by hybridization or absence of hybridization to the DN A of a subject will be indicative 

1 5 of the identity of the allelic variant of the polymorphic region of the DISC 1 or DISC2 gene. 
Preferred probes include nucleic acid sequences comprising all or pari of any of the nucleic 
. acid sequences set forth in TABLE 5B (SEQ ID NOS: 33-43). 

Numerous procedures for determining the nucleotide sequence of a nucleic 
acid molecule, or for determining the presence of mutations in nucleic acid molecules 

20 include a nucleic acid amplification step, which can be carried out by, e.g., the polymerase 
chain reaction (PCR). Accordingly, in one embodiment, the invention provides primers for 
amplifying portions of a DISCI or DISC2 gene, such as portions of exons and/or portions 
of introns. In a preferred embodiment, the exons and/or sequences adjacent to the exons of 
the human DISCI or DISC2 gene will be amplified to, e.g., detect which allelic variant of 

25 a polymorphic region is present in the DISC 1 or DISC2 gene of a subject. Preferred primers 
comprise a nucleotide sequence complementary a specific allelic variant of a DISCI or 
DISC2 polymorphic region and of sufficient length to selectively hybridize with a DISCI 
or DISC2 gene. In a preferred embodiment, the primer, e.g., a substantially purified 
oligonucleotide, comprises a region having a nucleotide sequence which hybridizes under 
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stringent conditions to about 6, 8, 10, or 12, preferably 25, 30, 40, 50, or 75 consecutive 
nucleotides of a DISC 1 or DISC2 gene. In an even more preferred embodiment, the primer 
is capable of hybridizing to a DISCI or DISC2 nucleotide sequence and has a nucleotide 
sequence of any sequence set forth in any of SEQ ID NOS:33-127, complements thereof, 
5 allelic variants thereof, or complements of allelic variants thereof. For example, primers 
comprising a nucleotide sequence of at least about 1 5 consecutive nucleotides, at least about 
25 nucleotides or having from about 15 to about 20 nucleotides set forth in any of SEQ ID 
NOS:33-127, or complement thereof are provided by the invention. Primers having a 
sequence of more than about 25 nucleotides are also within the scope of the invention. 

10 Preferred primers of the invention are primers that can be used in PCR for amplifying each 
of the exons of a DISCI or DISC2 gene. 

Primers can be complementary to nucleotide sequences located close to each 
other or further apart, depending on the use of the amplified DN A. For example, primers can 
be chosen such that they amplify DNA fragments of at least about 1 0 nucleotides or as much 

15 as several kilobases. Preferably, the primers of the invention will hybridize selectively to 
nucleotide sequences located about 150 to about 350 nucleotides apart. 

For amplifying at least a portion of a nucleic acid, a forward primer (i.e., 5' 
primer) and a reverse primer (i e. t 3' primer) will preferably be used. Forward and reverse 
primers hybridize to complementary strands of a double stranded nucleic acid, such that upon 

20 extension from each primer, a double stranded nucleic acid is amplified. Forward and 
reverse primers are shown in FIG. 4 (SEQ ID NOS:44 -127). 

DISCI or DISC2 nucleic acid molecules of the invention, whether genomic 
DNA, cDNA, mRNA or otherwise, can be isolated from any source including, for example, 
cDNA or genomic libraries. Preferably, the cDNA library is a library generated from cells, 

25 tissue or organ, such as brain, which expresses a DISCI or DISC2 gene of the invention. 
Methods for obtaining particular genes (L e. , DISC 1 or DISC2 genes and nucleic acids) from 
such libraries are well known in the art, as described above (see, e.g. , Sambrook et al , 1 989, 
supra). 
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The DNA may be obtained by standard procedures known in the art from 
cloned DNA (for example, from a DNA "library "), and preferably is obtained from a cDNA 
library prepared from cells or tissue with high level expression of the gene or its gene 
product (for example, from brain cells or tissue). In one embodiment, the DNA may be 
5 obtained from a "subtraction" library to enrich the library for cDNAs of genes specifically 
expressed by a particular cell type or under certain conditions. In still other embodiments, 
a library may be prepared by chemical synthesis, by cDNA cloning, or by the cloning of 
genomic DNA or fragments thereof purified from the desired cell (see, for example, 
Sambrook et al, 1989, supra; Glover, D.M. edL 1985. DNA Cloning: A Practical 

10 Approach, MRL Press, Ltd., Oxford, U.K. Vols. I and II). 

Clones derived from genomic DNA may contain regulatory and intron DNA 
regions in addition to coding regions. Clones derived from cDNA generally will not contain 
intron sequences. Whatever the source, the gene is preferably molecularly cloned into a 
suitable vector for propagation of the gene. Identification of the specific DNA fragment 

15 containing the desired DISCI or DISC2 gene may be accomplished in a number of ways. 
For example, a portion of a DISCI or DISC2 gene exemplified infra can be purified and 
labeled to prepare a labeled probe (Benton & Davis, Science 1977, 196: 180; Grunstein & 
Hogness, Proa Natl Acad. Set USA. 1975, 72:3961). Those DNA fragments with 
substantial homology to the probe, such as an allelic variant from another individual, will 

20 hybridize thereto. In a specific embodiment, highest stringency hybridization conditions are 
used to identify a homologous DISC J or DISC2 gene. 

Further selection can be carried out on the basis of properties of the DISCI 
or DISC2 gene product; such as if the gene encodes a protein product having the isoelectric 
electrophoretic, amino acid composition, partial or complete amino acid sequence, antibody 

25 binding activity or ligand binding profile of a DISC 1 polypeptide as disclosed herein. Thus, 
the presence of the gene may be detected by assays based on the physical, chemical, 
immunological or functional properties of its expressed product. 

Other DNA sequences which encode substantially the same amino acid 
sequence as a DISCI gene may be used in the practice of the present invention. These 
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include, but are not limited to allelic variants, species variants, sequence conservative 
variants, and functional variants. In particular, the nucleic acid sequences of the invention 
include both "function-conservative variants" and "sequence-conservative variants". Nucleic 
acid substitutions may be made, for example, to alter the amino acid residue encoded by a 
5 particular codon, and thereby substitute an amino acid sequence in a DISC 1 polypeptide for 
one with a particularly preferred property. 

Polymorphisms in DISCI and DISC2 

The present invention also provides, in preferred embodiments, variant DISC 1 

10 and DISC2 nucleic acids including variants which comprise one or more single nucleotide 
polymorphisms (SNPs). As an example, and not by way of limitation, TABLE 5, infra, 
discloses several single nucleotide polymorphisms (SNPs) of the DISC 1 genomic sequences 
set forth in TABLE 3. TABLE 6A discloses similar SNPs of the DISCI cDNA sequence 
set forth in SEQ ID NO: 1 . In addition, the Examples, infra, demonstrate that these SNPs are 

15 ones which correlate with neuropsychiatric disorders. Accordingly, DISCI and DISC2 
nucleic acid molecules which comprise one or more of these SNPs are particularly preferred 
embodiments of DISCI and DISC2 nucleic acids of the present invention. 

The polymorphic sequences of the invention can ad\antageously be used as 
primers to amplify an allelic variant of a DISC 1 and/or D1SC2 gene, I e. , nucleic acids which 

20 are capable of selectively hybridizing to an allelic variant of a polymorphic region of a 
DISCI and/or DISC2 gene. Thus, such primers can be specific for a DISCI and/or DISC2 
gene sequence, so long as they have a nucleotide sequence which is capable of hybridizing 
to a DISCI and/or DISC2 gene. Preferred primers are capable of specifically hybridizing to 
any of the allelic variants listed in TABLE 5 . Such primers can be used, e.g., in sequence 

25 specific oligonucleotide priming as described further herein. 

The DISCI and DISC2 nucleic acids of the invention can also be used as 
probes, e.g., in therapeutic and diagnostic assays. For instance, the present invention 
provides a probe comprising a substantially purified oligonucleotide, which oligonucleotide 
comprises a region having a nucleotide sequence that is capable of hybridizing specifically 
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to a region of a DISCI and DISC2 gene which is polymorphic (see TABLE 5, infra). In an 
even more preferred embodiment of the invention, the probes are capable of hybridizing 
specifically to an allelic variant of a DISCI and/or DISC2 gene, and comprise all or part of 
the sequences set forth in TABLE 5B (SEQ ID NOS: 33-43). Such probes can then be used 
5 to specifically detect which allelic variant of a polymorphic region of a DISC 1 and/or DISC2 
gene is present in a subject. The polymorphic region can be located in the promoter, exon, 
or intron sequences of a DISCI or DISC2 gene. 

For example, preferred probes of the invention are those probes listed in 
TABLE SB, wherein the bold nucleotides represent the location of the nucleotide 

10 polymorphism. For each probe listed in T ABLE 5B, the complement of that probe is also 
included in the table as a preferred probe of the invention. Particularly preferred probes of 
the invention have a number of nucleotides sufficient to allow specific hybridization to the 
target nucleotide sequence. Where the target nucleotide sequence is present in a large 
fragment of DNA, such as a genomic DNA fragment of several tens or hundreds of 

15 kilobases, the size of the probe may have to be longer to provide sufficiently specific 
hybridization, as compared to a probe which is used to detect a target sequence which is 
present in a shorter fragment of DNA. For example, in some diagnostic methods, a portion 
of a DISCI and/or DISC2 gene may first be amplified and thus isolated from the rest of the 
chromosomal DNA and then hybridized to a probe. In such a situation, a shorter probe will 

20 likely provide sufficient specificity of hybridization. For example, a probe having a 
nucleotide sequence of about 10 nucleotides may be sufficient. 

In preferred embodiments, the probe or primer further comprises a label 
attached thereto, which, e.g., is capable of being detected, e.g. the label group is selected 
from amongst radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. 

25 In another preferred embodiment of the invention, the isolated nucleic acid, 

which is used, e.g., as a probe or a primer, is modified, such as to become more stable. 
Exemplary nucleic acid molecules which are modified include phosphoramidate, 
phosphothioate and methylphosphonate analogs of DNA (see also U.S. Patents 5,176,996; 
5,264,564; and 5,256,775). 
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The nucleic acids of the invention can also be modified at the base moiety, 
sugar moiety, or phosphate backbone, for example, to improve stability of the molecule. The 
nucleic acids, e.g., probes or primers, may include other appended groups such as peptides 
(e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the cell 
5 membrane (see, e.g., Letsinger et al, 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; 
Lemaitreef a/., 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. WO88/09810, 
published December 15, 1988), hybridization-triggered cleavage agents. (See, e.g., Krol et 
aL, 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Pharm. 
Res. 5:539-549). To this end, the nucleic acid of the invention may be conjugated to another 

10 molecule, e.g., a peptide, hybridization triggered cross-linking agent, transport agent, 
hybridization-triggered cleavage agent, etc. 

The isolated nucleic acid comprising a DISCI and DISC2 intronic sequence 
may comprise at least one modified base moiety which is selected from the group including 
but not limited to 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 

1 5 xantine, 4-acetylcytidine, 5 -(carboxy hydroxy methyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1 -methylguanine, 1-methylinosine, 2, 2-dimethyl guanine, 
2-methyladenine, 2-methylguanine, 3-methylcytidine, 5-methylcytidine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethy]-2-thiouracil, beta-D- * 

20 mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytidine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouraeii, 5-methyluracil, uracil-5- 
oxyacetic acidmethylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3- 
N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

25 The isolated nucleic acid may also comprise at least one modified sugar 

moiety selected from the group including but not limited to arabinose, 2-fluoroarabinose, 
xylulose, and hexose. 

In yet another embodiment, the nucleic acid comprises at least one modified 
phosphate backbone selected from the group consisting of a phosphorothioate, a 
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phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a 
methylphosphonate, an alkyl phosphotriester, and a formacetal or analog thereof. 

In yet a further embodiment, the nucleic acid is an a-anomeric 
oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded hybrids with 
complementary RNA in which, contrary to the usual p-units, the strands run parallel to each 
other (Gautier et al 9 1987, Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a 2'-0- 
methylribonucleotide (Inoue et al. 9 1987, Nucl Acids Res, 15:6131-6148), or a chimeric 
RNA-DNA analogue (Inoue et al, 1987, FEES Lett. 215:327-330). 

Any nucleic acid fragment of the invention can be prepared according to 
methods well known, in the art and described, e.g., in Sambrook, J. Fritsch, E.F., and 
Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. For example, discrete fragments of the DNA 
can be prepared and cloned using restriction enzymes. Alternatively, discrete fragments can 
be prepared using the Polymerase Chain Reaction (PCR) using primers having an appropriate 
sequence. 

Oligonucleotides of the invention may be synthesized by standard methods 
known in the art, e.g. by use of an automated DNA synthesizer (such as are commercially 
available from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate 
oligonucleotides may be synthesized by the method of Stein et al (1988, Nucl. Acids Res. 
16:3209), methylphosphonate oligonucleotides can be prepared by use of controlled pore 
glass polymer supports (Smnetal , 1988, Proc.Natl. Acad. Sci. U.S.A. 85:7448-7451), or 
other means know in the art. 

The invention also provides other variants of DISC 1 and DISC2 nucleic acids, 
including nucleic acids having variant microsatellite repeats. A "microsatellite repeat" or 
"microsatellite", as the term is used herein, refers to a short sequence of repeating nucleotides 
within a nucleic acid. Typically, a microsatellite repeat comprises a repeating sequence of 
two (/. e. , a dinucleotide repeat), three (/. e. , a trinucleotide repeat), four (L e. , a tetranucleotide 
repeat) or five (i.e., a pentanucleotide repeat) nucleotides. Microsatellites of the invention 
therefore have the general formula (N„ N 2 , . . . N,)„, wherein N represents a nucleic acid 
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residue (e.g., adenine, thymine, cytosine or guanine), / represents the number of the last 
nucleotide in the microsatellite, and n represents the number of times the motif is repeated 
in the microsatellite locus. In one embodiment the number of nucleotides in a microsatellite 
motif (/) is about six, preferably between two and five, and more preferably two, three or 
5 four. The total number of repeats (n) in a microsatellite repeat may be, e.g., from one to 
about 60, preferably from 4 to 40, and more preferably from 10 to 30 when / = 2; is 
preferably between about 4-25, and more preferably between about 6-22 when / = 3; and is 
preferably between about 4-15, and more preferably between about 5-10 when z = 4. A 
DISCI and DISC2 nucleic acid of the invention may comprise any microsatellite repeat of 

10 the above general formula. However, the following motifs are particular^ preferred: CA, 
TC, and, AATTG; as well as all complements and permutations of such motifs (for example, 
TG, GA, and CAATT. These variant DISCI and DISC2 nucleic acids are also considered 
part of the present invention. 

Accordingly, the nucleic acid molecules of the present invention include 

15 DISCI and DISC2 nucleic acid molecules having one or more of the polymorphisms 
described in TABLES 5 and6(SEQ IDNOS: 33-43). In preferred embodiments, the nucleic 
acid molecules of the invention include specific DISCI and DISC2 allelic variants, which 
differ from the reference or wild-type DISCI and DISC2 nucleic acid molecules described 
supra (i. e. , nucleic acid molecules having the nucleotide sequence set forth in SEQ ID NOS 

20 1 and 3, and in TABLE 3). 

The genes encoding DISCI and/or DISC2 derivatives and analogs of the 
invention can be produced by various methods known in the art. The manipulations which 
result in their production can occur at the gene or protein level. For example, the cloned 
DISC1/DISC2 gene sequence can be modified by any of numerous strategies known in the 

25 art (see, e.g., Sambrook et ai, 1989, supra). The sequence can be cleaved at appropriate 
sites with restriction endonuclease(s), followed by further enzymatic modification if desired, 
isolated, and ligated in vitro. In the production of the gene encoding a derivative or analog 
DISC 1 and/or DISC2, care should be taken to ensure that the modified gene remains within 
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the same translational reading frame as the DISC1/DISC2 gene, uninterrupted by 
translational stop signals, in the gene region where the desired activity is encoded. 

Additionally, the DISCI and DISC2-encoding nucleic acid sequences can be 
mutated in vitro or in vivo, to create and/or destroy translation, initiation and/or termination 
5 sequences, or to create variations in coding regions and/or form new restriction endonuclease 
sites or destroy preexisting ones, to facilitate further in vitro modification. Modifications can 
also be made to introduce restriction sites and facilitate cloning the DISC 1/DISC2 genes into 
an expression vector. An technique for mutagenesis known in the art can be used, including 
but not limited to, in vitro site-directed mutagenesis (Hutchinson et al , J. Biol Chem. 1 978, 

10 253:6551; Zoller & Smith, DNA 1984, 3:479-488; Oliphant et al, Gene 1986, 44:177; 
Hutchinson et al, Proc. Natl Acad. Set U.S.A. 1986, 83:710), use of TAB" linkers 
(Pharmacia), etc. PCR techniques are preferred for site directed mutagenesis (see, Higuchi, 
1 989, "Using PCR to Engineer DNA" in PCR Technology: Principles and Applications for 
DNA Amplification, H. Erlich, ed., Stockton Press, Chapter 6, pp. 61-70). 

15 The identified and isolated gene can then be inserted into an appropriate 

cloning vector. A large number of vector-host systems known in the art may be used. 
Possible cloning vectors include, but are not limited to, plasmids or modified viruses. The 
vector system must, however, by compatible with the host cell used. Examples of vectors 
include, but are not limited to, E. coli, bacteriophages such as lambda derivatives, or 

20 plasmids such aspBR322 derivatives or pUC plasmid derivatives, e.g. , pGEX vectors, pmal- 
c, pFLAG, pKK plasmids (Clonetech), pET plasmids (Novagen, Inc., Madison, WI), pRSET 
or pREP plasmids, pcDNA (Invitrogen, Carlsbad, CA), pMAL plasmids (New England 
Biolabs, Beverly, MA), etc. The insertion into a cloning vector can, for example, be 
accomplished by ligating the DNA fragment into a cloning vector which has complementary 

25 cohesive termini. However, if the complementary restriction sites used to fragment the DNA 
are not present in the cloning vector, the ends of the DNA molecules may be enzymatically 
modified. Alternatively, any site desired may be produced by ligating nucleotide sequences 
{i.e., "linkers") onto the DNA termini. These ligated linkers may comprise specific 
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chemically synthesized oligonucleotides encoding restriction endonuclease recognition 
sequences. 

Recombinant molecules can be introduced into host cells via transformation, 
transfection, infection, electroporation, etc. , so that many copies of the gene sequence are 
5 generated. Preferably, the cloned gene is contained on a shuttle vector plasmid, which 
provides for expansion in a cloning cell (for example, E. coli) and facile purification for 
subsequent insertion into an appropriate expression cell line, if such is desired. For example, 
a shuttle vector, which is a vector that can replicate in more than one type of organism, can 
be prepared for replication in both E coli and Saccharomyces cerevisiae by linking 
10 sequences from an E. coli plasmid with sequence from the yeast 2m plasmid. 

Isolating DISCI and DISC2 Nucleic Acids 
DISC 1 and DISC2 nucleic acid molecules of the invention, whether genomic 
DNA, cDNA or otherwise, can be isolated from any source. Methods for obtaining DISCI 
15 and DISC2 gene are well known in the art, as described above (see, e.g., Sambrook et al, 
1989, supra). 

The DNA may be obtained by standard procedures known in the art from 
cloned DNA (for example, from a DNA "library"), and preferably is obtained from a cDNA 
library prepared from tissues with high level expression of the gene product. In other 

20 embodiments, a library may be prepared by chemical synthesis, by cDNA cloning, or by the 
cloning of genomic DNA or fragments thereof, purified from the desired ceil (See, for 
example, Sambrook etal, 1989, supra, Glover, D.M. ed., 1985, DNA Cloning: A Practical 
Approach, MRL Press, Ltd., Oxford, U.K. Vols. I and II). 

Clones derived from genomic DNA may contain regulatory and intron DNA 

25 region in addition to coding regions. Clones derived from cDN A generally will not contain 
intron sequences. Whatever the source, the gene should be molecularly cloned into a suitable 
vector for propagation of the gene. Identification of the specific DNA fragment containing 
the desired DISC 1 or DISC2 gene may be accomplished in a number of ways. For example, 
a portion of an DISCI or DISC2 gene exemplified infra can be purified and labeled to 
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prepare a labeled probe (Benton & Davis, Science 1911, 196:180; Grunstein & Hogness, 
Proc. Natl Acad. Set U.S.A. 1975, 72:3961). Those DNA fragments with substantial 
homology to the probe, such as an allelic variant from another individual, will hybridize. In 
a specific embodiment, highest stringency hybridization conditions are used to identify a 
5 homologous DISC 1 or DISC2 gene. 

Further selection can be carried out on the basis of the properties of the gene, 
e.g. , if the gene encodes a protein product having the isoelectric, electrophoretic, amino acid 
composition, partial or complete amino acid sequence, antibody binding activity, or ligand 
binding profile of DISCI or DISC2 protein as disclosed herein. Thus, the presence of the 

10 gene may be detected by assays based on the physical, chemical, immunological, or 
functional properties of its expressed product. 

Other DNA sequences which encode substantially the same amino acid 
sequence as an OSCAR gene may be used in the practice of the present invention. These 
include but are not limited to allelic variants, species variants, sequence conservative 

15 variants, and functional variants. In particular, the nucleic acid sequences of the invention 
include both "function-conservative variants" and "sequence-conservative variants". 
Function-conservative variants of a nucleic acid are those nucleic acids which encode a 
function-conservative variant of a polypeptide, as defined supra. "Sequence-conservative 
variants" of a nucleic acid are nucleic acids that have a different polynucleotide sequence but 

20 encode the same amino acid sequence. 

Amino acid substitutions may also be introduced to substitute an amino acid 
with a particularly preferable property. For example, a Cys may be introduced a potential 
site for disulfide bridges with another Cys. 

The genes encoding DISC 1 or DISC2 derivatives and analogs of the invention 

25 can be produced by various methods known in the art. The manipulations which result in 
their production can occur at the gene or protein level. For example, the cloned OSCAR 
gene sequence can be modified by any of numerous strategies known in the art (Sambrook 
et al, 1989, supra). The sequence can be cleaved at appropriate sites with restriction 
endonuclease(s), followed by further enzymatic modification if desired, isolated, and ligated 
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in vitro. In the production of the gene encoding a derivative or analog of DISC 1 or DISC2, 
care should be taken to ensure that the modified gene remains within the same translational 
reading frame as the wild-type gene, uninterrupted by translational stop signals, in the gene 
region where the desired activity is encoded. 
5 Additionally, the DISCI or DISC2 nucleic acid sequence can be mutated in 

vitro or in vivo, to create and/or destroy translation, initiation, and/or termination sequences, 
or to create variations in coding regions and/or form new restriction endonuclease sites or 
destroy preexisting ones, to facilitate further in vitro modification.. Modifications can also 
be made to introduce restriction sites and facilitate cloning the DISCI or DISC2 gene into 

1 0 an expression vector. Any technique for mutagenesis known in the art can be used, including 
but not limited to, in vitro site-directed mutagenesis (Hutchinson, C, et aL, J. Biol. Chem. 
253:6551, 1978; Zoller and Smith, DNA 3:479-488, 1984; Oliphant et al 9 Gene 44:177, 
1986; Hutchinson et aL, Proc. Natl Acad. Sci. U.S.A. 83:710, 1986), use of TAB" linkers 
(Pharmacia), etc. PCR techniques are preferred for site directed mutagenesis (see Higuchi, 

15 1 989, "Using PCR to Engineer DNA" 5 in PCR Technology: Principles and Applications for 
DNA Amplification, H. Erlich, ed., Stockton Press, Chapter 6, pp. 61-70). 

The identified and isolated gene can then be inserted into an appropriate 
cloning vector. A large number of vector-host systems known in the an may be used. 
Possible vectors include, but are not limited to, plasmids or modified viruses, but the vector 

20 system must be compatible with the host cell used. Examples of vectors include, but are not 
limited to, E. coli, bacteriophages such as lambda derivatives, or plasmids such as pBR322 
derivatives or pUC plasmid derivatives, e.g. , pGEX vectors, pmal-c, pFL AG, pKK plasmids 
(Clonetech), pET plasmids (Novagen, Inc., Madison, WI), pRSET or pREP plasmids 
(Invitrogen, San Diego, CA), or pMAL plasmids (New England Biolabs, Beverly, MA), etc. 

25 The insertion into a cloning vector can, for example, be accomplished by ligating the DNA 
fragment into a cloning vector which has complementary cohesive termini. However, if the 
complementary restriction sites used to fragment the DNA are not present in the cloning 
vector, the ends of the DNA molecules may be enzymatically modified. Alternatively, any 
site desired may be produced by ligating nucleotide sequences (linkers) onto the DNA 
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termini; these ligated linkers may comprise specific chemically synthesized oligonucleotides 
encoding restriction endonuclease recognition sequences. 

Recombinant molecules can be introduced into host cells via transformation, 
transfection, infection, electroporation, etc., so that many copies of the gene sequence are 
5 generated. Preferably, the cloned gene is contained on a shuttle vector plasmid, which 
provides for expansion in a cloning cell, e.g., E. coli, and facile purification for subsequent 
insertion into an appropriate expression cell line, if such is desired. For example, a shuttle 
vector, which is a vector that can replicate in more than one type of organism, can be 
prepared for replication in both E colt and Saccharomyces cerevisiae by linking sequences 
10 from an E. coli plasmid with sequences form the yeast 2m plasmid. 

5. 3 DISCI and DISC2 Gene Products 

Gene products of the DISCI and DISC2 genes are described above. As 
explained above, the DISC2 gene is not predicted to encode a polypeptide or protein. Rather, 

J 5 DISC2 is thought to be a non-protein coding gene for a structural RNA. As such, the DISC2 
gene products of the present invention are, themselves, DISC2 nucleic acids. These DISC2 
gene products may include, therefore, all of the variations and modifications described supra 
for DISC2 nucleic acids of the invention. 

The DISCI gene is predicted to encode a polypeptide having the amino acid 

20 sequence shown in FIG. 2 and set forth in SEQ ID NO:2. This DISCI polypeptide may be 
divided into at least two regions of distinct secondary structure. The first region, which is 
referred to as the N-terminal region, comprises approximately amino acid residues 1-347 of 
SEQ ID NO:2 and is predicted to comprise at least one globular domain. The second region 
of secondary structure is also referred to as the C-terminal domain and comprises 

25 approximately amino acid residues 348-854 of SEQ ID NO:2. The C-terminal region 
comprises at least three a-helix regions which are indicated by bold underlining in FIG. 2. 
The a-helix regions correspond to approximately amino acid residues 367-394, 452-500 and 
602-630 of SEQ ID NO:2. It is understood that the a-helices contained in the C-terminal 
region may interact with each other to form coiled coil structures. 
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A splice variant of the DISC 1 gene product is also known to exist. This splice 
variant of the DISCI gene product is predicted to encode a polypeptide having the amino 
acid sequence shown in FIG. 2 (SEQ ID NO:2). However, the sequence corresponding to 
the amino acid residues indicated by light underlining in FIG. 2 (i.e., amino acid residues 
5 748-768 of SEQ ID NO:2) is omitted from this splice variant of the DISCI gene product. 

In one specific embodiment, a DISCI polypeptide is a polypeptide from a 
human cell or tissue and, more preferably, from a human brain cell or tissue. For example, 
a human DISCI polypeptide of the invention may comprise the amino acid sequence set 
forth in SEQ ID NO:2. Likewise, a DISC2 RNA is RNA from a human cell or tissue and, 

10 more preferably, from a human brain cell or tissue, which sequence corresponds to a 
sequence transcribed from a wild-type or variant DISC2 gene. 

In other embodiments, DISCI and DISC2 gene products also include 
fragments of a full length DISC 1 protein or DISC 1 RNA. For example, polypeptides which 
may be used in the compositions and methods of the present invention include polypeptides 

1 5 comprising an epitope of a full length DISC 1 polypeptide, such as epitopes of the full length 
DISCI polypeptide shown in FIG. IB (SEQ ID NO:2) or a splice variant thereof (for 
example, the splice variant described above). An epitope of a DISC 1 polypeptide represents 
a site on the polypeptide against which an antibody may be produced and to which the 
antibody binds. Thus, polypeptides comprising the amino acid sequence of a DISC 1 epitope 

20 are useful for making antibodies to a DISCI protein. Preferably, an epitope comprises a 
sequence of at least 5, more preferably at least 10, 15, 20, 25 or 50 amino acid residues in 
length. Thus, DISCI polypeptides of the invention that comprise epitopes of a DISCI 
protein preferably contain at least 5, at least 10, at least 15, at least 20, at least 25 or at least 
50 amino acid residues of a DISCI protein sequence. For example, in certain preferred 

25 embodiments wherein the epitope is an epitope of the DISC 1 polypeptide set forth in FIG. 
2 (SEQ ID NO:2), a DISCI polypeptide of the invention may comprise an amino acid 
sequence corresponding to a sequence of at least 5, at least 10, at least 15, at least 20, at least 
25 or at least 50 amino acid residues of the sequence set forth in FIG. 2 (SEQ ID NO:2). 
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The DISCI polypeptides or DISC2 RNA sequences of the invention also 
include analogs and derivatives of the full length DISCI polypeptides set forth in FIG. 2 
(SEQ ID NO:2) or the full length DISC2 RNA sequence. Analogs and derivatives of the 
DISCI polypeptides or DISC2 RNA sequences of the invention have the same or 
5 homologous characteristics of DISCI polypeptides or DISC2 RNA sequences set forth 
above. 

A DISCI chimeric fusion polypeptide can be prepared in which the DISCI 
portion of the fusion protein has one or more characteristics of the DISC 1 polypeptide. Such 
fusion polypeptides therefore represent embodiments of the DISCI polypeptides of the 

1 0 present invention. Exemplary DI SC 1 fusion polypeptides include ones which comprise a full 
length, derivative or truncated DISCI amino acid sequence, as well as fusions which 
comprise a fragment of an OSCAR polypeptide sequence (e g. , a fragment corresponding to 
an epitope or to one or more domains or regions). Such fusion polypeptides may also 
comprise the amino acid sequence of a marker polypeptide; for example FLAG, a histidine 

15 tag, glutathione S-transferase (GST) or hemaglutinin. In other embodiments, a DISCI 
polypeptide may be expressed with a bacterial protein such as (3-galactosidase. Additionally, 
DISCI fusion polypeptides may comprise amino acid sequences that increase solubility of 
the polypeptide, such as a thioreductase amino acid sequence or the sequence of one or more 
immunoglobulin proteins (e.g., IgGl or IgG2). 

20 DISC 1 analogs or variants can also be made by altering the wild-type DISC 1 

polypeptide sequence, e.g., by introducing one or more amino acid residue substitutions, 
additions or deletions. In one embodiment, such altered polypeptides are functionally similar 
molecules (i.e., molecules that perform one or more DISCI functions or have one or more 
DISCI bioactivities). Thus, in a specific embodiment, an analog of a DISCI polypeptide is 

25 a function-conservative variant, as the term is defined supra. 

In a preferred embodiment, an analog of a DISCI polypeptide is associated 
with a neuropsychiatric disorder, such as schizophrenia, schizoaffective disorder, bipolar 
affective disorder, unipolar affective disorder and adolescent conduct disorder. For example, 
the Examples infra describe various mutations to the DISCI gene which encode an analog 
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DISCI gene product. Such analog DISCI gene products represent exemplary, specific 
embodiments of the analog DISCI gene products of the present invention. 

Amino acid residues, other than the ones that are specifically identified herein 
as being conserved, may differ among variants of a protein or polypeptide. Accordingly, the 
5 percentage of protein or amino acid sequence similarity between any two DISCI 
polypeptides of similar function may vary. Typically, the percentage of protein or amino 
acid sequence similarity between different DISCI polypeptide analogs or variants may be 
from 70% to 99%, as determined according to an alignment scheme such as the Cluster 
Method and/or the MEGALIGN algorithm. Analogs and variants of a DISCI polypeptide 

10 therefore also include polypeptide that have at least 50%, preferably at least 75%, more 
preferably at least 85% and still more preferably at least 90% amino acid sequence identity 
as determined, e.g. , by BLAST or FASTA algorithms. In one embodiment, such analogs and 
variants of a DISCI polypeptide are function-conservative variants which have the same or 
similar properties, functions or bioactivities as the native polypeptide to which they are 

15 compared. In another, preferred embodiment, such analogs and variants of a DISCI 
polypeptide are ones which are associated with a neuropsychiatric disorder, such as 
schizophrenia, schizoaffective disorder, bipolar affective disorder, unipolar affective disorder 
and adolescent conduct disorder. It is further noted that the analogs of the DISCI 
polypeptides of the present invention include, not only homologs and variants of the full 

20 length DISC 1 polypeptides of the present invention (e.g. , variants of a DISC 1 polypeptide 
comprising the sequence set forth in FIG. 2 and in SEQ ID NO:2), but also include variants 
of modified DISCI polypeptides (e.g., truncations and deletions) and of fragments (e.g., 
corresponding to particular domains, regions or epitopes) of a full length DISC 1 polypeptide. 

In yet other embodiment, an analog of a DISCI polypeptide is an allelic 

25 variant or mutant of a DISCI polypeptide. The term allelic variant and mutant, when used 
to describe a polypeptide, refers to a polypeptide encoded by an allelic variant or mutant 
gene. Thus, the allelic variant and mutant DISCI polypeptides of the invention are 
polypeptides encoded by allelic variants of mutants of the DISCI nucleic acid molecules of 
the present invention. 
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In yet other embodiments, an analog of a DISC 1 polypeptide is a substantially 
homologous polypeptide from the same species (e.g., an allelic variant) or from another 
species (e.g. , an orthologous polypeptide); preferably from another mammalian species such 
as mouse, rat, rabbit, hamster, guinea pig, primate (e.g., monkey or human), cats, dogs, 
5 sheep, goats, pigs, horses, cows, etc. However, DISCI homologous of the invention may be 
from any species, including chickens, Xenopus, yeast (e.g. , Saccharomyces cerevisiae), and 
bacteria (e.g., E. coli) to name a few. 

In a specific embodiment, two polypeptide sequence are "substantially 
homologous" or "substantially similar" when the polypeptides are at least 35-40% similar 

10 as determined by one of the algorithms disclosed herein, preferably at least about 60% 
similar, and more preferably at least about 90 or 95% similar in one or more highly 
conserved domains or, for allelic variants, across the entire amino acid sequence. 

In other embodiments, variants of a DISCI polypeptide (including analogs, 
orthologs and homologs) are polypeptides encoded by nucleic acid molecules that hybridize 

15 to the complement of a nucleic acid molecule encoding a DISCI polypeptide; e.g., in a 
Southern hybridization experiment under defined conditions. For example, in a particular 
embodiment analogs and/or homologs of a DISCI polypeptide comprise amino acid 
sequences encoded by nucleic acid molecules that hybridize to a complement of a DISCI 
nucleic acid sequence, such as the coding sequence of the DISCI cDNA sequence set froth 

20 in FIGS. 1A-C (SEQ ID NO:l), under highly stringent hybridization conditions that 
comprise 50% formamide and 5X or 6X SSC. in other embodiments, the analogs and/or 
homologs of a DISCI polypeptide may comprise amino acid sequences encoded by nucleic 
acid molecules that hybridize to a complement of a DISCI nucleic acid sequence (e.g. , the 
complement of the coding sequence of the DISCI cDNA sequence set forth in FIGS. 1A-C 

25 and in SEQ ID NO:l) under moderately stringent hybridization conditions (i.e., 40% 
formamide with 5X or 6X SSC), or under low stringency conditions (e.g. , in 5X SSC, 0. 1 % 
SDS, 0.25% milk, no formamide, 30% formamide, 5X SSC or 0.5% SDS). 

In still other embodiments, variants (including analogs, homologs and 
orthologs) of a DISCI polypeptide can also be identified by isolating variant DISCI genes, 
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e.g., by PCR using degenerate oligonucleotide primers designed on the basis of amino acid 
sequence of the DISCI polypeptide (see, supra). 

Derivatives of the DISCI polypeptides of the invention further include, but 
are by no means limited to, phosphorylated DISC 1 , myristylated DISCI, methylated DISC 1 
5 and other DISCI polypeptides that are chemically modified. DISCI polypeptides of the 
invention may include labeled variants; form example, radio-labeled with iodine or 
phosphorous (see, e.g. , EP 372707B) or other detectable molecule such as, but by no means 
limited to, biotin, a fluorescent dye (e.g. , Cy5 or Cy3), a chelating group complexed with a 
metal ion, a chromophore or fluorophore, a gold colloid, a particle such as a latex bead, or 

10 attached to a water soluble polymer. 

Chemical modification of a biologically active component or components of 
DISCI (or DISC2) nucleic acids or polypeptides may provide additional advantages under 
certain circumstances. See, for example, U.S. Patent No. 4,1 79,337 issued December 1 8, 
1970 to Davis et al. Also, for a review see Abuchowski et aL, in Enzymes as Drugs (J.S. 

15 Holcerberg and J. Roberts, eds. 1981), pp. 367-383. A review article describing protein 
modification and fusion proteins is found in Francis, Focus on Growth Factors 1 992, 3 :4-l 0, 
Mediscript: Mountview Court, Friem BarneL Lane, London N20, OLD, UK. 

Polymorphisms in the DISCI polypeptide 

20 The present invention provides isolated polymorphic DISCI polypeptides, 

such as DISCI polypeptides which are encoded by specific allelic variants of DISCI genes, 
including those identified herein. Accordingly, preferred DISCI polypeptides of the 
invention have an amino acid sequence which differs from SEQ ID NO: 2. In one 
embodiment, the DISC 1 polypeptides are isolated from, or otherwise substantially free of 

25 other cellular proteins. The term "substantially free of other cellular proteins" (also referred 
to herein as "contaminating proteins") or "substantially pure or purified preparations" are 
defined as encompassing preparations of DISCI polypeptides having less than about 20% 
(by dry weight) contaminating protein, and preferably having less than about 5% 
contaminating protein. It will be appreciated that functional forms of the subject 
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polypeptides can be prepared, for the first time, as purified preparations by using a cloned 
gene as described herein. 

Preferred DISC 1 proteins of the invention have an amino acid sequence which 
is at least about 60%, 70%, 80%, 85%, 90%, or 95% identical or homologous to an amino 
5 acid sequence of SEQ ID NO: 2. Even more preferred DISCI proteins comprise an amino 
acid sequence which is at least about 97, 98, or 99% homologous or identical to an amino 
acid sequence of SEQ ID NO:2. Such proteins can be recombinant proteins, and can be, e.g., 
produced in vitro from nucleic acids comprising a specific allele of a DISCI polymorphic 
region. For example, recombinant polypeptides preferred by the present invention can be 

10 encoded by a nucleic acid, which is at least 85% homologous and more preferably 90% 
homologous and most preferably 95 % homologous with a nucleotide sequence set forth in 
SEQ ID NO: 1, and comprises an allele of a polymorphic region that differs from that set 
forth in SEQ ID NO: 1 . Polypeptides which are encoded by a nucleic acid that is at least 
about 98-99% homologous with the sequence of SEQ ID NO: 1, and comprise an allele of 

15 a polymorphic region that differs from that set forth in SEQ ID NO: 1 are also within the 
scope of the invention. 

In a preferred embodiment, a DISCI protein of the present invention is a 
mammalian DISCI protein. In an even more preferred embodiment, the DISCI protein is 
a human protein, such as a DISCI polypeptide comprising an amino acid sequence from 

20 SEQ ID NO: 2 in which amino acid 607 is an phenylalanine residue and/or amino acid 704 
is a cysteine. 

DISC 1 polypeptides are preferably capable of functioning in one of either role 
of an agonist or antagonist of at least one biological activity of a wild-type ("authentic") 
DISCI protein of the appended sequence listing. The term f, evolutionarily related to", with 
25 respect to amino acid sequences of DISC 1 proteins, refers to both polypeptides having amino 
acid sequences which have arisen naturally, and also to mutational variants of human DISC 1 
polypeptides which are derived, for example, by combinatorial mutagenesis. 
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Full length proteins or fragments corresponding to one or more particular 
motifs and/or domains or to arbitrary sizes, for example, at least 5, 10, 25, 50, 75 and 100, 
amino acids in length are within the scope of the present invention. 

Isolated peptidyl portions of DISCI proteins can be obtained by screening 
5 peptides recombinantly produced from the corresponding fragment of the nucleic acid 
encoding such peptides. In addition, fragments can be chemically synthesized using 
techniques known in the art such as conventional Merrifield solid phase f-Moc or t-Boc 
chemistry. For example, a DISCI polypeptide of the present invention may be arbitrarily 
divided into fragments of desired length with no oveilap of the fragments, or preferably 
10 divided into overlapping fragments of a desired length. The fragments can be produced 
(recombinantly or by chemical synthesis) and tested to identify those peptidyl fragments 
which can function as either agonists or antagonists of a wild-type (e.g., "authentic") DISCI 
protein. 

In general, polypeptides refeired to herein as having an activity (e.g., are 
15 "bioactive") of a DISCI protein are defined as polypeptides which mimic or antagonize all 
or a portion of the biological/biochemical activities of a DISCI protein having SEQ ID NO: 
2, such as the ability to bind a substrate or ligand. Other biological activities of the subject 
DISCI proteins will be reasonably apparent to those skilled in the art. According to the 
present invention, a polypeptide has biological activity if it is a specific agonist or antagonist 
20 of a naturally-occurring form of a DISCI protein. 

Assays for determining whether a DISCI protein or variant thereof has one 
or more biological activities are well known in the art. 

Other preferred proteins of the invention are those encoded by the nucleic 
acids set forth in the section pertaining to nucleic acids of the invention. In particular, the 
25 invention provides fusion proteins, e.g., DISCI -immunoglobulin fusion proteins. Such 
fusion proteins can provide, e.g., enhanced stability and solubility of DISCI proteins and 
may thus be useful in therapy. Fusion proteins can also be used to produce an immunogenic 
fragment of a DISCI protein. For example, the VP6 capsid protein of rotavirus can be used 
as an immunologic carrier protein for portions of the DISCI polypeptide, either in the 
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monomeric form or in the form of a viral particle. The nucleic acid sequences corresponding 
to the portion of a subject DISCI protein to which antibodies are to be raised can be 
incorporated into a fusion gene construct which includes coding sequences for a late vaccinia 
virus structural protein to produce a set of recombinant viruses expressing fusion proteins 
5 comprising DISCI epitopes as part of the virion. It has been demonstrated with the use of 
immunogenic fusion proteins utilizing the Hepatitis B surface antigen fusion proteins that 
recombinant Hepatitis B virions can be utilized in this role as well. Similarly, chimeric 
constructs coding for fusion proteins containing a portion of a DISCI protein and the 
poliovirus capsid protein can be created to enhance immunogenicity of the set of polypeptide 

10 antigens (see, for example, EP Publication No: 0259149; and Evans et al (1989) Nature 
339:385; Huang etal (1988) J. Virol. 62:3855; and Schlienger etal (1992) J. Virol. 66:2). 

The Multiple antigen peptide system for peptide-based immunization can also 
be utilized to generate an immunogen, wherein a desired portion of a DISCI polypeptide is 
obtained directly from organo-chemical synthesis of the peptide onto an oligomeric 

15 branching lysine core (see, for example, Posnett etal (1988) JBC 263:1719 and Nardelli et 
al (1992) J. Immunol 148:914). Antigenic determinants of DISCI proteins can also be 
expressed and presented by bacterial cells. 

In addition to utilizing fusion proteins to enhance immunogenicity, it is 
widely appreciated that fusion proteins can also facilitate the expression of proteins, and 

20 accordingly, can be used in the expression of the DISCI polypeptides of the present 
invention. For example, DISCI polypeptides can be generated as glutathione-S-transferase 
(GST-fusion) proteins. Such GST-fusion proteins can enable easy purification of the DISC 1 
polypeptide, as for example by the use of glutathione-derivatized matrices (see, for example, 
Current Protocols in Molecular Biology, eds. Ausubel et al (N.Y.: John Wiley & Sons, 

25 1991)). 

The present invention further pertains to methods of producing the subject 
DISC 1 polypeptides. For example, a host cell transfected with a nucleic acid vector directing 
expression of a nucleotide sequence encoding the subject polypeptides can be cultured under 
appropriate conditions to allow expression of the peptide to occur. Suitable media for cell 
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culture are well known in the art. The recombinant DISC 1 polypeptide can be isolated from 
cell culture medium, host cells, or both using techniques known in the art for purifying 
proteins including ion-exchange chromatography, gel filtration chromatography, 
ultrafiltration, electrophoresis, and immunoaffinity purification with antibodies specific for 
5 such peptide. In a preferred embodiment, the recombinant DISCI polypeptide is a fusion 
protein containing a domain which facilitates its purification, such as GST fusion protein. 

Moreover, it will be generally appreciated that, under certain circumstances, 
it may be advantageous to provide homologs of one of the subject DISCI polypeptides 
which function in a limited capacity as one of either a DISCI agonist (mimetic) or a DISCI 

10 antagonist, in order to promote or inhibit only a subset of the biological activities of the 
naturally-occurring form of the protein. Thus, specific biological effects can be elicited by 
treatment with a homolog of limited function, and with fewer side effects relative to 
treatment with agonists or antagonists which are directed to all of the biological activities of 
naturally occurring forms of DISCI proteins. 

15 Homologs of each of the subject DISCI proteins can be generated by 

mutagenesis, such as by discrete point mutation(s), or by truncation. For instance, mutation 
can give rise to homologs which retain substantially the same, or merely a subset, of the 
biologica] activity of the DISCI polypeptide from which it was derived. Alternatively, 
antagonistic forms of the protein can be generated which are able to inhibit the function of 

20 the naturally occurring form of the protein, such as by competitively binding to a substrate 
or ligand. 

The recombinant DISCI polypeptides of the present invention also include 
homologs of DISC 1 polypeptides which differ from the DISC 1 proteins having SEQ ID NO: 
2, such as versions of those protein which are resistant to proteolytic cleavage, as for 
25 example, due to mutations which alter ubiquitination or other enzymatic targeting associated 
with the protein. 

DISCI polypeptides may also be chemically modified to create derivatives 
by forming covalent or aggregate conjugates with other chemical moieties, such as glycosyl 
groups, lipids, phosphate, acetyl groups and the like. Covalent derivatives of DISC 1 proteins 
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can be prepared by linking the chemical moieties to functional groups on amino acid side- 
chains of the protein or at the N-terminus or at the C-terminus of the polypeptide. 

Modification of the structure of the subject DISCI polypeptides can be for 
such purposes as enhancing therapeutic or prophylactic efficacy, stability (e.g., ex vivo shelf 
5 life and resistance to proteolytic degradation), or post-translational modifications (e.g., to 
alter phosphorylation pattern of protein). Such modified peptides, when designed to retain 
at least one activity of the naturally -occurring form of the protein, or to produce specific 
antagonists thereof, are considered functional equivalents of the DISCI polypeptides 
described in more detail herein. Such modified peptides can be produced, for instance, by 

1 0 amino acid substitution, deletion, or addition . The substitutional variant may be a substituted 
conserved amino acid or a substituted non-conserved amino acid. For example, it is 
reasonable to expect that an isolated replacement of a leucine with an isoleucine or valine, 
an aspartate with a glutamate, a threonine with a serine, or a similar replacement of an amino 
acid with a structurally related amino acid (i.e. isosteric and/or isoelectric mutations) will not 

15 have a major effect on the biological activity of the resulting molecule. Conservative 
replacements are those that take place within a family of amino acids that are related in their 
side chains. Genetically encoded amino acids can be divided into four families: (1) acidic = 
aspartate, glutamate; (2) basic = lysine, arginine, histidine; (3) nonpolar = alanine, valine, 
leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and (4) uncharged polar 

20 = glycine, asparagine, glutamine, cysteine, serine, threonine, tyrosine. In similar fashion, the 
amino acid repertoire can be grouped as (1 ) acidic = aspartate, glutamate; (2) basic = lysine, 
arginine histidine, (3) aliphatic = glycine, alanine, valine, leucine, isoleucine, serine, 
threonine, with serine and threonine optionally be grouped separately as aliphatic-hydroxyl; 
(4) aromatic = phenylalanine, tyrosine, tryptophan; (5) amide = asparagine, glutamine; and 

25 (6) sulfur -containing = cysteine and methionine, (see, for example, Biochemistry, 2 nd ed., 
Ed. by L. Stryer, WH Freeman and Co.: 1981). Whether a change in the amino acid 
sequence of a peptide results in a functional DISCI homolog (e.g., functional in the sense 
that the resulting polypeptide mimics or antagonizes the wild-type form) can be readily 
determined by assessing the ability of the variant peptide to produce a response in cells in 
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a fashion similar to the wild-type protein, or competitively inhibit such a response. 
Polypeptides in which more than one replacement has taken place can readily be tested in 
the same manner. 



5 5. 4 Expression of DISCI and DISC2 Gene Products 

The nucleotide sequence coding for DISCI or DISC2, or for an antigenic 
fragment, derivative or analog thereof, for a functionally active derivative (including a 
chimeric protein) thereof, or for a mutant or variant DISCI or DISC2 (such as for one of the 
variants or mutants described in the Examples, infra) can be inserted into an appropriate 

1 0 expression vector, /. e. , a vector which contains the necessary elements for the transcription 
and translation of the inserted protein-coding sequence. Thus, a nucleic acid encoding 
DISCI or DISC2 of the invention can be operationally associated with a promoter in an 
expression vector of the invention. Both cDNA and genomic sequences can be cloned and 
expressed under control of such regulatory sequences. Such vectors can be used to express 

1 5 functional or functionally inactivated DISC 1 and DISC2 gene products (for example DISC 1 
and DISC2 RNA or DISCI polypeptides). These include, for example, a DISCI or DISC2 
nucleic acid having one or more mutations or polymorphisms that are associated with a 
neuropsychiatric disorder, such as DISCI nucleic acids having one or more of the 
polymorphisms specified in TABLE 5 and in TABLE 6A of the Examples, infra. In 

20 addition, nucleic acids that encode a variant DISCI polypeptide, for example a variant 
DISCI polypeptide associated with a neuropsychiatric disorder and/or having one or more 
of the amino acid substitutions disclosed in TABLE 6B of the Examples, infra, may be 
cloned and expressed according to the methods described here. 

The necessary transcriptional and translational signals can be provided on a 

25 recombinant expression vector. 

Potential host- vector systems include but are not limited to mammalian cell 
systems transfected with expression plasmids or infected with virus {e.g., vaccinia virus, 
adenovirus, adeno-associated virus, herpes virus, etc.); insect cell systems infected with virus 
{e.g., baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria 
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transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression 
elements of vectors vary in their strengths and specificities. Depending on the host-vector 
system utilized, any one of a number of suitable transcription and translation elements may 
be used. 

5 Expression of DISC 1 or DISC2 may be controlled by any promoter/enhancer 

element known in the art, but these regulatory elements must be functional in the host 
selected for expression. Promoters which may be used to control DISCI or DISC2 gene 
expression include, but are not limited to, cytomegalovirus (CMV) promoter (U.S. Patent 
Nos. 5,385,839 and No. 5,1 68,062), the SV40 early promoter region (Benoist and Chambon, 

10 1981, Nature 290:304-310), the promoter contained in the 3' long terminal repeat of Rous 
sarcoma virus (Yamamoto, et al., Cell 22:787-797, 1980), the herpes thymidine kinase 
promoter (Wagner et al, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445, 1981), the regulatory 
sequences of the metallothionein gene (Brinster et al. , Nature 296:39-42, 1 982); prokaryotic 
expression vectors such as the b-lactamase promoter (Villa-Komaroff, et al, Proc. Natl. 

15 Acad. Sci. U.S.A. 75:3727-3731, 1978), or the tac promoter (DeBoer, et al, Proc. Natl 
Acad. Sci. U.S.A. 80:21-25, 1983); see also "Useful proteins from recombinant bacteria" in 
Scientific American, 242:74-94, 1980; promoter elements from yeast or other fungi such as 
the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol 
kinase) promoter, alkaline phosphatase promoter; and transcriptional control regions that 

20 exhibit hematopoietic tissue specificity, in particular: beta-globin gene control region which 
is active in myeloid cells (Mogram et al, Nature 315:338-340, 1985; Kollias et al, Cell 
46:89-94, 1986), hematopoietic stem cell differentiation factor promoters, erythropoietin 
receptor promoter (Maouche et al, Blood, 15:2557, 1991), etc. 

Indeed, any type of plasmid, cosmid, YAC or viral vector may be used to 

25 prepare a recombinant nucleic acid construct which can be introduced to a cell, or to tissue, 
where expression of an a DISCI or a DISC2 gene product is desired. Alternatively, wherein 
expression of a recombinant DISCI or DISC2 gene product in a particular type of cell or 
tissue is desired, viral vectors that selectively infect the desired cell type or tissue type can 
be used. 



In another embodiment, the invention provides methods for expressing DISC 1 
polypeptides or DISC2 RNA by using a non-endogenous promoter to control expression of 
an endogenous DISCI or DISC2 gene within a cell. An endogenous DISCI or DISC2 gene 
within a cell is a gene of the present invention which is ordinarily (i.e., naturally) found in 
5 the genome of that cell. A non-endogenous promoter, however, is a promoter or other 
nucleotide sequence that may be used to control expression of a gene but is not ordinarily or 
naturally associated with the endogenous DISCI or DISC2 gene. As an example, methods 
of homologous recombination may be employed (preferably using non-protein encoding 
DISCI or DISC2 nucleic acid sequences of the invention) to insert an amplifiable gene or 

10 other regulatory sequence in the proximity of an endogenous DISCI or DISC2 gene. The 
inserted sequence may then be used, e.g., to provide for higher levels of DISCI or DISC2 
gene expression than normally occurs in that cell, or to overcome one or more mutations in 
the endogenous DISCI or DISC2 regulatory sequences which prevent norma] levels of 
DISCI or DISC2 gene expression. Such methods of homologous recombination are well 

15 known in the art. See, for example, International Patent Publication No. WO 91/06666, 
published May 16, 1991 by Skoultchi; International Patent Publication No. WO 91/099555, 
published July 11, 1991 by Chappel; and International Patent Publication No. WO 
90/14092, published November 29, ) 990 by Kucherlapati and Campbell 

Soluble forms of the protein can be obtained by collecting culture fluid, or 

20 solubilizing inclusion bodies, e.g. , by treatment with detergent, and if desired sonication or 
other mechanical processes, as described above. The solubilized or soluble protein can be 
isolated using various techniques, such as polyacrylamide gel electrophoresis (PAGE), 
isoelectric focusing, 2-dimensional gel electrophoresis, chromatography (e.g. , ion exchange, 
affinity, immunoaffinity, and sizing column chromatography), centrifugation, differential 

25 solubility, immunoprecipitation, or by any other standard technique for the purification of 
proteins. 

A wide variety of host/expression vector combinations may be employed in 
expressing the DNA sequences of this invention. Useful expression vectors, for example, 
may consist of segments of chromosomal, non-chromosomal and synthetic DNA sequences. 
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Suitable vectors include derivatives of SV40 and known bacterial plasmids, e.g., E. coli 
plasmids col El, pCRl , pBR322, pMal-C2, pET, pGEX (Smith et al , Gene 67:3 1 -40, 1 988), 
pMB9 and their derivatives, plasmids such as RP4; phage DNAs, e.g., the numerous 
derivatives of phage 1, e.g. , NM989, and other phage DNA, e.g. , Ml 3 and filamentous single 
5 stranded phage DNA; yeast plasmids such as the 2m plasmid or derivatives thereof; vectors 
useful in eukaryotic cells, such as vectors useful in insect or mammalian cells; vectors 
derived from combinations of plasmids and phage DNAs, such as plasmids that have been 
modified to employ phage DNA or other expression control sequences; and the like. 

Preferred vectors are viral vectors, such as lentiviruses, retroviruses, herpes 

10 viruses, adenoviruses, adeno-associated viruses, vaccinia virus, baculovirus, and other 
recombinant viruses with desirable cellular tropism. Thus, a gene encoding a functional or 
mutant DISCI or DISC2 gene product or a fragment thereof (e.g., a polypeptide domain of 
a DISCI or DISC2 gene product) can be introduced in vivo, ex vivo, or in vitro using a viral 
vector or through direct introduction of DNA. Expression in targeted tissues can be effected 

15 by targeting the transgenic vector to specific cells, such as with a viral vector or a receptor 
ligand, or by using a tissue-specific promoter, or both. Targeted gene delivery is described 
in International Patent Publication WO 95/28494, published October 1995. 

According to the present invention, vectors may be specifically targeted to 
cells expressing either a DISCI or a DISC2 gene product (e.g., on their cell surface) using, 

20 for example, a DISCI - or DISC2-specific antibody (i.e., an antibody that specifically binds 
to a DISCI or a DISC2 gene product), or using a DISCI or a DISC2 binding partner (for 
example, a DIS1- or DISC2-specific ligand). Vectors may also be specifically targeted to 
cells expressing DISCI or DISC2 using fragments (e.g., peptide or polypeptide fragments) 
of a DISCI or DISC2 binding partner, particularly fragments which comprise DISCI or 

25 DISC2 binding sequence. Such methods may be used to target vectors expressing any gene 
to cells, including but not limited to, vectors that express DISC 1 or DISC2 specific antisense 
nucleic acids, or ribozymes which are specific for DISCI or DISC2. 

Viral vectors commonly used for in vivo or ex vivo targeting and therapy 
procedures are DNA-based vectors and retroviral vectors. Methods for constructing and 
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using viral vectors are known in the art (see, e.g. , Miller and Rosman, BioTechniques, 7:980- 
990, 1992). Preferably, the viral vectors are replication defective, that is, they are unable to 
replicate autonomously in the target cell. In general, the genome of the replication defective 
viral vectors which are used within the scope of the present invention lack at least one region 
5 which is necessary for the replication of the virus in the infected cell. These regions can 
either be eliminated (in whole or in part), be rendered non- functional by any technique 
known to a person skilled in the art. These techniques include the total removal, substitution 
(by other sequences, in particular by the inserted nucleic acid), partial deletion or addition 
of one or more bases to an essential (for replication) region. Such techniques may be 
10 performed in vitro (on the isolated DNA) or in situ, using the techniques of genetic 
manipulation or by treatment with mutagenic agents. Preferably, the replication defective 
virus retains the sequences of its genome which are necessary for encapsidating the viral 
particles. 

DNA vira] vectors include an attenuated or defective DNA virus, such as but 
15 not limited to herpes simplex virus (HSV), papillomavirus, Epstein Barr virus (EBV). 
adenovirus, adeno-associated virus (AAV), and the like. Defective viruses, which entirely 
or almost entirely lack viral genes, are preferred. Defective virus is not infective aftei 
introduction into a cell Use of defective viral vectors allows for administration to cells in 
a specific, localized area, without concern that the vector can infect other cells. Thus, a 
20 specific tissue can be specifically targeted. Examples of particular vectors include, but are 
not limited to, a defective herpes virus 1 (HSV1) vector (Kaphtt et al, Molec. Cell. 
Neurosci. 2:320-330, 1991), defective herpes virus vector lacking a glyco-protein L gene 
(Patent Publication RD 371005 A), or other defective herpes virus vectors (International 
Patent Publication No. WO 94/21807, published September 29, 1994; International Patent 
25 Publication No. WO 92/05263, published April 2, 1994); an attenuated adenovirus vector, 
such as the vector described by Stratford-Perricaudet^a/. (J. Clin. Invest. 90:626-630, 1992; 
see also La Salle et al, Science 259:988-990, 1993); and a defective adeno-associated virus 
vector (Samulski et al, J. Virol. 61:3096-3101, 1987; Samulski et al, J. Virol. 63:3822- 
3828, 1989; Lebkowski etaL, Mol. Cell. Biol. 8:3988-3996, 1988). 
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Various companies produce viral vectors commercially, including but by no 
means limited to Avigen, Inc. (Alameda, CA; AAV vectors), Cell Genesys (Foster City, CA; 
retroviral, adenoviral, AAV vectors, and lentiviral vectors), Clontech (retroviral and 
baculoviral vectors), Genovo, Inc. (Sharon Hill, PA; adenoviral and AAV vectors), Genvec 
5 (adenoviral vectors), IntroGene (Leiden, Netherlands; adenoviral vectors), Molecular 
Medicine (retroviral, adenoviral, AAV, and herpes viral vectors), Norgen (adenoviral 
vectors), Oxford BioMedica (Oxford, United Kingdom; lentiviral vectors), and Transgene 
(Strasbourg, France; adenoviral, vaccinia, retroviral, and lentiviral vectors). 

In another embodiment, the vector can be introduced in vivo by lipofection, 

10 as naked DNA, or with other transfection facilitating agents (peptides, polymers, etc.). 
Synthetic cationic lipids can be used to prepare liposomes for in vivo transfection of a gene 
encoding a marker (Feigner, et al, Proc. Natl. Acad. Sci. U.S.A. 84:7413-7417, 1987; 
Feigner and Ringold, Science 337:387-388, 1989; see Mackey , et al , Proc. Natl. Acad. Sci. 
U.S.A. 85:8027-8031, 1988; Ulmer et al, Science 259:1745-1748, 1993). Useful lipid 

15 compounds and compositions for transfer of nucleic acids are described in International 
Patent Publications W095/18863 and W096/17823, and in U.S. Patent No. 5,459,127. 
Lipids may be chemically coupled to other molecules for the purpose of targeting (see 
, Mackey, et. al., supra). Targeted peptides, e. g. , hormones or neurotransmitters, and proteins 
such as antibodies, or non-peptide molecules could be coupled to liposomes chemically. 

20 Other molecules are also useful for facilitating transfection of a nucleic acid in vivo, such as 
a cationic oligopeptide {e.g. , International Patent Publication W095/2 1 93 1 ), peptides derived 
from DNA binding proteins {e.g., International Patent Publication WO96/25508), or a 
cationic polymer {e.g., International Patent Publication W095/21931). 

It is also possible to introduce the vector in vivo as a naked DNA plasmid. 

25 Naked DNA vectors for gene therapy can be introduced into the desired host cells by 
methods known in the art, e.g., electroporation, microinjection, cell fusion, DEAE dextran, 
calcium phosphate precipitation, use of a gene gun, or use of a DNA vector transporter (see, 
e.g., Wu etal, J. Biol. Chem. 267:963-967, 1992; Wu and Wu, J. Biol. Chem. 263:14621- 
14624, 1988; Hartmut et aL 9 Canadian Patent Application No. 2,012,31 1, filed March 15, 



1990; Williams et al ,Proc. Natl. Acad. Sci. USA 88:2726-2730, 1991). Receptor-mediated 
DNA delivery approaches can also be used (Curiel etal.,Yhxm. Gene Ther. 3:147-154, 1992; 
Wu and Wu, J. Biol. Chem. 262:4429-4432, 1987). US Patent Nos. 5,580,859 and 5,589,466 
disclose delivery of exogenous DNA sequences, free of transfection facilitating agents, in 
5 a mammal. Recently, a relatively low voltage, high efficiency in vivo DNA transfer 
technique, termed electrotransfer, has been described (Mir et al, CP. Acad. Sci., 321 :893. 
1998; WO 99/01157; WO 99/01158; WO 99/01175). 

Preferably, for in vivo administration, an appropriate immunosuppressive 
treatment is employed in conjunction with the viral vector, e.g., adenovirus vector, to avoid 
10 immuno-deactivation of the viral vector and transfected cells. For example, 
immunosuppressive cytokines, such as interleukin-12 (IL-12), interferon-g (lFN-y), or anti- 
CD4 antibody, can be administered to block humoral or cellular immune responses to the 
viral vectors (see, e.g., Wilson, Nature Medicine, 1995). In that regard, it is advantageous 
to employ a viral vector that is engineered to express a minimal number of antigens. 

15 

5.5 Antibodies to DISCI and DISC2 

Antibodies to DISCI and DISC2 are useful, inter alia, for diagnostics and 
intracellular regulation of DISCI and DISC2 activity, as set forth below. For example, such 
antibodies may be used in the methods and compositions of this invention to diagnose and/or 

20 treat disorders, including neuropsychiatric disorders (for example, schizophrenia, 
schizoaffective disorder, bipolar affective disorder, unipolar affective disorder and adolescent 
conduct disorder) associated with either DISCI, DISC2 or both DISCI and DISC2. 

According to the invention, DISCI and DISC2 polypeptides and/or nucleic 
acids that are produced recombinantly or by chemical synthesis, as well as fragments or other 

25 derivatives or analogs thereof (including fusion proteins), may be used as an immunogen to 
generate antibodies that recognize DISCI or DISC2. Such antibodies include but are not 
limited to polyclonal, monoclonal, chimeric, single chain, Fab fragments, and an Fab 
expression library. Such an antibody is specific for human DISCI or DISC2; it may 
recognize a mutant form of DISCI or DISC2, a wild-type DISCI or DISC2, or both. 
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For example, in preferred embodiments an antibody may recognize and 
specifically bind to a mutant or variant DISCI or DISC2. Such an antibody may recognize 
and specifically bind to a DISC 1 or DISC2 gene product {e.g. , a polypeptide or nucleic acid) 
encoded by a DISC 1 or DISC2 nucleic acid that has at least one polymorphism (e.g. , at least 
5 one of the specific DISCI and DISC2 polymorphisms described in the Examples, infra). 
Preferably, such an antibody recognizes and specifically binds to a gene product encoded by 
a variant DISCI or DISC2 nucleic acid but does not bind to a gene product encoded by a 
wild-type DISCI or DISC2 nucleic acid (e.g., a gene product encoded by any of the DISCI 
and DISC2 nucleic acids set forth in FIGS. 1 A-C and in FIGS. 3A-G, respectively, and in 

10 SEQIDNOS:! and 3). 

Various procedures known in the art may be used for the production of 
polyclonal antibodies to OSCAR polypeptide or derivative or analog thereof. For the 
production of antibody, various host animals can be immunized by injection with the 
OSCAR polypeptide, or a derivative (e.g. , fragment or fusion protein) thereof, including but 

1 5 not limited to rabbits, mice, rats, sheep, goats, etc. In one embodiment, a DISC 1 or DISC2 
polypeptide or fragment thereof can be conjugated to an immunogenic carrier, e.g., bovine 
serum albumin (BSA) or keyhole limpet hemocyanin (KLH). Various adjuvants may be 
used to increase the immunological response, depending on the host species, including but 
not limited to Freund's (complete and incomplete), mineral gels such as aluminum 

20 hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful 
human adjuvants such as BCG (bacille Calmette-Guerin) and Corynebacterium parvum. 

For preparation of monoclonal antibodies directed toward the DISCI or 
DISC2 gene product, or a fragment, analog, or derivative thereof, any technique that provides 

25 for the production of antibody molecules by continuous cell lines in culture may be used. 
These include but are not limited to the hybridoma technique originally developed by Kohler 
and Milstein (Nature 1975, 256:495-497), as well as the trioma technique, the human B-cell 
hybridoma technique (Kozbor et al , Immunology Today 1 983, 4:72; Cote et al , Proc. Natl. 
Acad. Sci. U.S.A. 1983, 80:2026-2030), and the EBV-hybridoma technique to produce 



human monoclonal antibodies (Cole et ah , in Monoclonal Antibodies and Cancer Therapy, 
Alan R. Liss, Inc., 1985, pp. 77-96). In an additional embodiment of the invention, 
monoclonal antibodies can be produced in germ-free animals (International Patent 
Publication No. WO 89/12690). In fact, according to the invention, techniques developed 
5 for the production of "chimeric antibodies" (Morrison et al, J. Bacteriol. 1984, 159:870; 
Neuberger et aL, Nature 1984, 312:604-608; Takeda et al, Nature 1985, 314:452-454) by 
splicing the genes from a mouse antibody molecule specific for an OSCAR polypeptide 
together with genes from a human antibody molecule of appropriate biological activity can 
be used; such antibodies are within the scope of this invention. Such human or humanized 
10 chimeric antibodies are preferred for use in therapy of human diseases or disorders 
(described infra), since the human or humanized antibodies are much less likely than 
xenogenic antibodies to induce an immune response, in particular an allergic response, 
themselves. 

Antibody fragments which contain the idiotype of the antibody molecule can 
1 5 be generated by known techniques. For example, such fragments include but are not limited 
to : the F(ab f ) ? fragment which can be produced by pepsin digestion of the antibody molecule; 
the Fab' fragments which can be generated by reducing the disulfide bridges of the F(ab') 2 
fragment, and the Fab fragments which can be generated by treating the antibody molecule 
with papain and a reducing agent. 
20 According to the invention, techniques described for the production of single 

chain antibodies (U.S. Patent Nos. 5,476,786, 5,132,405, and 4,946,778) can be adapted to 
produce DISCI- or DISC2-specific single chain antibodies. An additional embodiment of 
the invention utilizes the techniques described for the construction of Fab expression libraries 
(Huse et al, Science 1989, 246:1275-1281) to allow rapid and easy identification of 
25 monoclonal Fab fragments with the desired specificity for DISCI or DISC2, or for its 
derivatives or analogs. 

In the production and use of antibodies, screening for or testing with the 
desired antibody can be accomplished by techniques known in the art, e.g., 
radioimmunoassay, ELISA (enzyme-linked immunosorbant assay), "sandwich" 
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immunoassays, immunoradiometric assays, gel diffusion precipitin reactions, 
immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope 
labels, for example), western blots, precipitation reactions, agglutination assays (e.g., gel 
agglutination assays, hemagglutination assays), complement fixation assays, 
5 immunofluorescence assays, protein A assays, and Immunoelectrophoresis assays, etc. In 
one embodiment, antibody binding is detected by detecting a label on the primary antibody. 
In another embodiment, the primary antibody is detected by detecting binding of a secondary 
antibody or reagent to the primary antibody. In a further embodiment, the secondary 
antibody is labeled. Many means are known in the art for detecting binding in an 

10 immunoassay and are within the scope of the present invention. For example, to select 
antibodies which recognize a specific epitope of a DISCI or DISC2 gene product, one may 
assay generated hybridomas for a product which binds to a DISCI or DISC2 fragment 
containing such epitope. For selection of an antibody specific to either a DISC 1 or a DISC2 
gene product from a particular species of animal, one can select on the basis of positive 

15 binding with a DISCI or a DISC2 gene product expressed by or isolated from cells of that 
species of animal. 

The foregoing antibodies can be used in methods known in the art relating to 
the localization and activity of a DISCI or a DISC2 gene product, such as a polypeptide. For 
example, the antibodies may be used for Western blotting, imaging DISC 1 or DISC2 in situ, 

20 measuring levels thereof in appropriate physiological samples, etc. using any of the detection 
techniques mentioned above or known in the art. Such antibodies can also be used in assays 
for ligand binding, e.g., as described in US Patent No. 5,679,582. Antibody binding 
generally occurs most readily under physiological conditions, e.g., pH of between about 7 
and 8, and physiological ionic strength. The presence of a carrier protein in the buffer 

25 solutions stabilizes the assays. While there is some tolerance of perturbation of optimal 
conditions, e.g., increasing or decreasing ionic strength, temperature, or pH, or adding 
detergents or chaotropic salts, such perturbations will decrease binding stability. 
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In still other embodiments, anti-DISCl and/or anti-DISC2 antibodies may 
also be used to isolate cells which express either DISCI or DISC2, by panning or related 
immunoadsorption techniques. 

In a specific embodiment, antibodies that agonize or antagonize the activity 
5 of DISCI or DISC2 can be generated. In particular, intracellular single chain Fv antibodies 
can be used to regulate (inhibit) DISC 1 or DISC2 activity (Marasco et al , Proc. Natl. Acad. 
Sci. U.S.A. 1993, 90:7884-7893; Chen., Mol. Med. Today 1997, 3:160-167; Spitz et al, 
Anticancer Res. 1996, 16:3415-22; Indolfi efa/.,Nat. Med. 1996,2:634-635; Kijmaefa/., 
Pharmacol Ther. 1995, 68:247-267). Such antibodies can be tested using the assays 
1 0 described infra for identifying ligands. 



5. 6 In Vivo Testing Using Transgenic Animals 

Transgenic animals, including transgenic mammals, may be prepared for 
evaluating the molecular mechanism(s) of DISC 1 or DISC2 and, particularly, for evaluating 

1 5 the molecular mechanism(s) of disease and disorders, for example neuropsychiatric disorders 
{e.g., schizophrenia, schizoaffective disorder, bipolar affective disorder, unipolar affective 
disorder and adolescent conduct disorder), that are associated with DISCI or DISC2. Such 
animals provide excellent models for screening and/or testing dmg candidates for such 
disorders. Thus, human DISC 1 and/or DISC2 "knock-in" animals, including human DISC 1 

20 and/or DISC2 "knock-in" mammals, can be prepared for evaluating the molecular biology 
to this system in greater detail than is possible with human subjects. It is also possible to 
evaluate compounds or diseases in "knockout" animals, e.g. , to identify a compound that can 
compensate for a defect in DISC 1 or DISC2 activity. Both technologies permit manipulation 
of single units of genetic information in their natural position in a cell genome and to 

25 examine the results of that manipulation in the background of a terminally differentiated 
organism. Trangenic mammals can be prepared by any method, including but not limited 
to modification of embryonic stem (ES) cells and heteronuclear injecion into blast cells. 

A "knock-in" animal is an animal (e.g. , a mammal such as a mouse) in which 
an endogenous gene is substituted with a heterologous gene (Roamer et al, New Biol. 1 99 1 , 
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3:331). Preferably, the heterologous gene is "knocked-in" to a locus of interest, either the 
subject of evaluation (in which case the gene may be a reporter gene; see Elegant et al , Proc. 
Natl. Acad. Sci. USA 1998, 95:11897) of expression or function of a homologous gene, 
thereby linking the heterologous gene expression to transcription from the appropriate 
5 promoter. This can be achieved by homologous recombination, transposon (Westphal and 
Leder, Curr Biol 1 997, 7:530), using mutant recombination sites (Araki et al , Nucleic Acids 
Res 1997, 25:868) or PCR (Zhang and Henderson, Biotechniques 1998, 25:784). 

A "knockout animal" is an animal (e.g., a mammal such as a mouse) that 
contains within its genome a specific gene that has been inactivated by the method of gene 

10 targeting {see, e.g., US Patents Nos. 5,777,195 and 5,616,491). A knockout animal includes 
both a heterozygote knockout (i.e., one defective allele and one wild-type allele) and a 
homozygous mutant. Preparation of a knockout animal requires first introducing a nucleic 
acid construct that will be used to suppress expression of a particular gene into an 
undifferentiated cell type termed an embryonic stem cell. This cell is then injected into a 

1 5 mammalian embryo. In preferred embodiments for which the knockout animal is a mammal, 
a mammalian embryo with an integrated cell is then implanted into a foster mother for the 
duration of gestation. Zhou, ex al. (Genes and Development, 1995, 9:2623-34) describes 
PPCA knock-out mice. 

The term "knockout" refers to partial or complete suppression of the 

20 expression of at least a portion of a protein encoded by an endogenous DNA sequence in a 
cell. The term "knockout construct" refers to a nucleic acid sequence that is designed to 
decrease or suppress expression of a protein encoded by endogenous DNA sequences in a 
cell. The nucleic acid sequence used as the knockout construct is typically comprised of: (1) 
DNA from some portion of the gene (exon sequence, intron sequence, and/or promoter 

25 sequence) to be suppressed; and (2) a marker sequence used to detect the presence of the 
knockout construct in the cell. The knockout construct is inserted into a cell, and integrates 
with the genomic DNA of the cell in such a position so as to prevent or interrupt 
transcription of the native DNA sequence. Such insertion usually occurs by homologous 
recombination (i.e., regions of the knockout construct that are homologous to endogenous 
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DNA sequences hybridize to each other when the knockout construct is inserted into the cell 
and recombine so that the knockout construct is incorporated into the corresponding position 
of the endogenous DNA). The knockout construct nucleic acid sequence may comprise: (1) 
a full or partial sequence of one or more exons and/or introns of the gene to be suppressed; 
5 (2) a full or partial promoter sequence of the gene to be suppressed; or (3) combinations 
thereof. Typically, the knockout construct is inserted into an embryonic stem cell (ES cell) 
and is integrated into the ES cell genomic DNA, usually by the process of homologous 
recombination. This ES cell is then inj ected into, and integrates with, the developing embryo. 

1 0 The phrases "disruption of the gene" and "gene disruption" refer to insertion 

of a nucleic acid sequence into one region of the native DNA sequence (usually one or more 
exons) and/or the promoter region of a gene so as to decrease or prevent expression of that 
gene in the cell as compared to the wild-type or naturally occurring sequence of the gene. By 
way of example, a nucleic acid construct can be prepared containing a DNA sequence 

15 encoding an antibiotic resistance gene which is inserted into the DNA sequence that is 
complementary to the DNA sequence (promoter and/or coding region) to be disrupted. When 
this nucleic acid construct is then transfected into a cell, the construct will integrate into the 
genomic DNA Thus, many progeny of the cell will no longer express the gene at least in 
some cells, or will express it at a decreased level, as the DNA is now disrupted by the 

20 antibiotic resistance gene. 

Generally, for homologous recombination, the DNA will be at least about 1 
kilobase (kb) in length and preferably 3-4 kb in length, thereby providing sufficient 
complementary sequence for recombination when the knockout construct is introduced into 
the genomic DNA of the ES cell (discussed below). 

25 Included within the scope of this invention is an animal, preferably a mammal 

(e.g., a mouse) in which two or more genes have been knocked out or knocked in, or both. 
Such animals can be generated by repeating the procedures set forth herein for generating 
each knockout construct, or by breeding two animals, each with a single gene knocked out, 
to each other, and screening for those with the double knockout genotype. 
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Regulated knockout animals can be prepared using various systems, such as 
the tet-repressor system (see US Patent No. 5,654,168) or the Cre-Lox system (see US 
Patents No. 4,959,317 and No. 5,801,030). 

In another series of embodiments, transgenic animals are created in which: 
5 (i) a human DISCI and/or DISC2 gene(s) is(are) stably inserted into the genome of the 
transgenic animal; and/or (ii) the endogenous DISC 1 and/or DISC2 genes are inactivated and 
replaced with their human counterparts (see, e.g., Coffman, Semin. Nephrol. 1997, 17:404; 
Esther et al 9 Lab. Invest 1996, 74:953; Murakami et aL, Blood Press. Suppl. 1996, 2:36). 
In one aspect of these embodiments, the human DISCI and/or DISC2 genes inserted into 

1 0 and/or expressed by the transgenic animal comprise a wild-type DISC 1 or DISC2 gene. For 
example, the wild-type human DISCI gene may be a gene that encodes a polypeptide having 
the amino acid sequence set forth in FIG. 2 (SEQ ID NO:2). The wild-type human DISC2 
gene may be a gene that encodes a nucleic acid gene product having the sequence shown in 
FIGS. 3A-G (SEQ ID NO:3). In another aspect of these embodiments, the human DISCI 

15 and/or DISC2 genes inserted into and/or expressed by the transgenic animal comprise a 
mutant or variant DISCI orDISC2gene. For example, a DISCI or DISC2 gene having one 
or more of the polymorphisms described in the Examples infra may be inserted into and/or 
expressed by a transgenic animal of the invention. In a particularly preferred aspect of ihese 
embodiments, the polymorphism or mutation is one that is associated with a neuropsychiatric 

20 disorder such as schizophrenia, schizoaffective disorder, bipolar affective disorder, unipolar 
affective disorder and adolescent conduct disorder. 

Such transgenic animals can be treated with candidate compounds and 
monitored for neuronal development, neurodegeneration, or efficacy of a candidate 
therapeutic compound. 

25 

5.7 A pplications and Uses 
Described herein are various applications and uses for DISCI and DISC2 
gene sequences (including fragments of full length DISCI and DISC2 gene sequences), 
DISCI and DISC2 polypeptides (including fragments of full length DISCI and DISC2 
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proteins, as well as DISCI and DISC2 fusion polypeptides) and of antibodies directed 
against DISCI or DISC2 nucleic acids, and antibodies directed against DISCI or DISC2 
polypeptides (including fragments of full length DISCI and DISC2 genes and proteins). 
Such applications may include, for example, both prognostic and diagnostic applications for 
5 evaluating neuropsychiatric disorders (for example, schizophrenia, schizoaffective disorder, 
bipolar affective disorder, unipolar affective disorder and adolescent conduct disorder) 
associated with a DISCI or DISC2 gene and/or with a DISCI or DISC2 gene product or 
polypeptide, including the identification of subjects having such a disorder or having a 
predisposition to such a disorder. Additionally, such applications may include methods for 

10 treating disorders associated with a DISCI or DISC2 gene, with a DISCI or DISC2 gene 
product, or with DISCI or DISC2 polypeptide, as well as screening methods to identify 
compounds (including natural ligands and other cellular compounds) that modulate the 
synthesis, expression or activity of either a DISCI or DISC2 gene, a DISCI or DISC2 gene 
product, a DISCI or DISC2 polypeptide, or a combination thereof 

1 5 As explained supra, the DISC 1 and D1SC2 genes and their gene products are 

associated with neuropsychiatric disorders such as schizophrenia, schizoaffective disorder, 
bipolar disorder, unipolar affective disorder and adolescent conduct disorder. In addition, 
the Examples infra provide novel mutations and polymorphisms in the DISCI and DISC2 
genes which are associated with such neuropsychiatric disorders. The Examples also 

20 describe variant DISCI and DISC2 gene products that are encoded by these altered DISCI 
and DISC2 genes and therefore are also associated with such neuropsychiatric disorders. 

Accordingly, in preferred embodiments the DISC 1 and DISC2 nucleic acids 
and polypeptides of the invention, as well as antibodies directed against such DISCI and 
DISC2 nucleic acids and polypeptides, may be used: (1) in prognostic and diagnostic 

25 applications to identify individuals having a neuropsychiatric disorder (for example, 
schizophrenia, schizoaffective disorder, bipolar affective disorder, unipolar affective disorder 
and adolescent conduct disorder) or having a predisposition to such a neuropsychiatric 
disorder; and (2) in screening methods for identifying compounds (including natural ligands 
and other cellular compounds, as well as synthetic chemical compounds) that modulate 
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activity of DISC 1 and/or DISC2 genes or gene products and may be used, therefore, to treat 
aneuropsychiatric disorder such as schizophrenia, schizoaffective disorder, bipolar affective 
disorder, unipolar affective disorder and adolescent conduct disorder. 



5 Prognostic and diagnostic assays 

The present methods provide means for determining if a subject has 
(diagnostic) or is at risk of developing (prognostic) a disease, condition or disorder that is 
associated with a DISCI or DISC2 allele, e.g., neuropsychiatric disorders such as 
schizophrenia, ADD, schizoaffective disorder, BAD, unipolar affective disorder, and 
10 adolescent conduct disorder, or a neuropsychiatric disease or disorder/disorders resulting 
therefrom. 

The present invention provides methods for determining the molecular 
structure of a DISCI or DISC2 gene, such as a human DISCI or DISC2 gene, or a portion 
thereof. In one embodiment, determining the molecular structure of at least a portion of a 

15 DISC 1 or DISC2 gene comprises determining the identity of the allelic variant of at least one 
polymorphic region of the gene (determining the presence or absence of one or more of the 
allelic variants, or their complements, of SEQ ID NOs.: 1-4, and/or the sequences set foith 
in TABLE 3). A polymorphic region of the DISC 1 or DISC2 gene can be located in an exon, 
an intron, at an intron/exon border, or in the promoter of the gene. 

20 The invention provides methods for determining whether a subject has, or is 

at risk of developing, a disease or condition associated with a specific allelic variant of a 
polymorphic region of a DISCI or DISC2 gene. Such diseases can be associated with an 
abnormal neurological activity, such as, e.g., those associated with the onset of a 
neuropsychiatric disorder such as schizophrenia, schizoaffective disorder, bipolar disorder, 

25 unipolar affective disorder and adolescent conduct disorder. An aberrant DISCI protein 
level can result from an aberrant transcription or post-transcriptional regulation. Thus, allelic 
differences in specific regions of a DISCI gene can result in differences in the encoded 
protein due to differences in regulation of expression. In particular, some of the identified 
polymorphisms in the human DISCI or DISC2 gene may be associated with differences in 
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the level of transcription, RNA maturation, splicing, or translation of the gene or 
transcription product. 

Analysis of one or more DISC 1 or DISC2 polymorphic region in a subj ect can 
be useful for predicting whether a subject has or is likely to develop aberrant neurological 
5 activities or disorders resulting therefrom, such as neuropsychatric disorders or diseases, 
e.g., schizophrenia, ADD, schizoaffectiove disorder, BAD, unipolar affective disorder, and 
adolescent conduct disorder. 

In preferred embodiments, the methods of the invention can be characterized 
as comprising detecting, in a sample of cells from the subject, the presence or absence of a 

1 0 specific allelic variant of one or more polymorphic regions of a DISC 1 or DISC2 gene. The 
allelic differences can be: (i) a difference in the identity of at least one nucleotide or (ii) a 
difference in the number of nucleotides, which difference can be a single nucleotide or 
several nucleotides. The invention also provides methods for detecting differences in DISC 1 
or DISC2 genes such as chromosomal rearrangements, e.g., chromosomal dislocation. The 

1 5 invention can also be used in prenatal diagnostics. 

A preferred detection method is allele specific hybridization using probes 
overlapping the polymorphic site and having about 5, 10, 20, 25, or 30 nucleotides around 
the polymorphic region. Examples of probes for detecting specific allelic variants of a 
polymorphic region located in the DISCI gene are nucleic acid sequences comprising a 

20 nucleotide sequence from any of SEQ ID NOS: 33-43, as set forth in TABLE 5A and 5B, 
infra. For instance, a probe for detecting a specific allelic variant in exon 9 is set forth in 
SEQ ID NO:33; a probe for detecting specific allelic variants of the polymorphic region 
located in intron 10 is set forth in SEQ ID NO:34; a probe for detecting specific allelic 
variants of the polymorphic region located in exon 1 1 is set forth in SEQ ID NO:35; probes 

25 for detecting specific allelic variants of the polymorphic region located in exon 1 3 is set forth 
in any of SEQ ID NOS:36-41; a probe for detecting a specific allelic variant of the 5'- 
promoter is provided in SEQ ID NO: 42; and a probe for detecting a specific allelic variant 
of a polymorphic region located in exon 6 is set forth in SEQ ID NO: 43. In a preferred 
embodiment of the invention, several probes capable of hybridizing specifically to allelic 
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variants are attached to a solid phase support, e.g., a "chip". Oligonucleotides can be bound 
to a solid support by a variety of processes, including lithography. For example a chip can 
hold up to 250,000 oligonucleotides (GeneChip, Affymetrix). Mutation detection analysis 
using these chips comprising oligonucleotides, also termed "DNA probe arrays" is described 
5 e.g., in Cronin et ah (1996) Human Mutation 7:244. In one embodiment, a chip comprises 
all the allelic variants of at least one polymorphic region of a gene. The solid phase support 
is then contacted with a test nucleic acid and hybridization to the specific probes is detected. 
Accordingly, the identity of numerous allelic variants of one or more genes can be identified 
in a simple hybridization experiment. For example, the identity of the allelic variant of the 

10 nucleotide polymorphism in the 5' promoter region can be determined in a single 
hybridization experiment. 

In other detection methods, it is necessary to first amplify at least a portion 
of _the DISCI or DISC2 gene prior to identifying the allelic variant. Amplification can be 
performed, e.g., by PGR and/or LCR (see Wu and Wallace, (1989) Genomics 4:560). 

1 5 according to methods known in the art. In one embodiment, genomic DNA of a cell is 
exposed to two PCR primers and amplification for a number of cycles sufficient to produce 
the required amount of amplified DNA. In preferred embodiments, the primers are located 
between 150 and 350 base pairs apart. Preferred primers, such as primers for amplifying 
each of the exons of the human DISCI or DISC2 gene, are listed in FIG. 4. 

20 Alternative amplification methods include: self sustained sequence replication 

(Guatelli, J.C. et ai, 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. etal, 1989, Proc. Natl. Acad. Sci. USA 86:1 173-1 177), 
Q-Beta Replicase (Lizardi, P.M. et al 9 1988, Bio/Technology 6:1197), and self-sustained 
sequence replication {Guatelli et ai, (1989) Proc. Nat Acad. Sci. 87: 1 874), and nucleic acid 

25 based sequence amplification (NABS A), or any other nucleic acid amplification method, 
followed by the detection of the amplified molecules using techniques well known to those 
of skill in the art. These detection schemes are especially useful for the detection of nucleic 
acid molecules if such molecules are present in very low numbers. 
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In one embodiment, any of a variety of sequencing reactions known in the art 
can be used to directly sequence at least a portion of a DISCI or DISC2 gene and detect 
allelic variants, e.g., mutations, by comparing the sequence of the sample sequence with the 
corresponding wild-type (control) sequence. Exemplary sequencing reactions include those 
5 based on techniques developed by Maxam and Gilbert (Proc. Natl Acad Sci USA (1977) 
74:560) or Sanger (Sanger et al (1977) Proc. Nat. Acad. Sci 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures may be utilized when 
performing the subject assays {Biotechniques (1995) 19:448), including sequencing by mass 
spectrometry (see, for example, U.S. Patent No. 5,547,835 and international patent 

10 application Publication Number WO 94/16101, entitled DNA Sequencing by Mass 
Spectrometry by H. Koster; U.S. Patent No. 5,547,835 and international patent application 
Publication Number WO 94/21822 entitled "DNA Sequencing by Mass Spectrometry Via 
Exonuclease Degradation" by H. Koster), and U.S Patent No. 5,605,798 and International 
Patent Application No. PCT/US96/03651 entitled DNA Diagnostics Based on Mass 

15 Spectrometry by H. Koster;. Cohen et al {\99b)Adv Chromatogr 36:127-162; and Griffin 
et al (1993) Appl Biochem Biotechnol 38:147-159). It will be evident to one skilled in the 
art that, for certain embodiments, the occurrence of only one, two or three of the nucleic acid 
bases need be determined in the sequencing reaction. For instance, A -track or the like, e.g., 
where only one nucleotide is detected, can be carried out 

20 Yet other sequencing methods are disclosed, e.g., in U.S. Patent No. 

5,580,732 entitled "Method of DNA sequencing employing a mixed DNA-polymer chain 
probe" and U.S. Patent No. 5,571,676 entitled "Method for mismatch-directed in vitro DNA 
sequencing". 

In seme cases, the presence of a specific allele of a DISCI or DISC2 gene in 
25 DNA from a subject can be shown by restriction enzyme analysis. For example, a specific 
nucleotide polymorphism can result in a nucleotide sequence comprising a restriction site 
which is absent from the nucleotide sequence of another allelic variant. Thus, 
polymorphisms can be determined by analyzing the products or restriction digests. 
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In a further embodiment, protection from cleavage agents (such as a nuclease, 
hydroxylamine or osmium tetroxide and with piperidine) can be used to detect mismatched 
bases in RNA/RNA DN A/DN A, or RN A/DN A heteroduplexes (Myers, et al (1985) Science 
230:1242). In general, the technique of "mismatch cleavage" starts by providing 
5 heteroduplexes formed by hybridizing a control nucleic acid, which is optionally labeled, 
e.g., RNA or DNA, comprising a nucleotide sequence of a 5-LO allelic variant with a sample 
nucleic acid, e.g, RNA or DNA, obtained from a tissue sample. The double-stranded 
duplexes are treated with an agent which cleaves single- stranded regions of the duplex such 
as duplexes formed based on base-pair mismatches between the control and sample strands. 

1 0 For instance, RNA/DNA duplexes can be treated with RNase and DNA/DN A hybrids treated 
with SI nuclease to enzymatically digest the mismatched regions. In other embodiments, 
either DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or osmium 
tetroxide and with piperidine in order to digest mismatched regions. After digestion of the 
mismatched regions, the resulting material is then separated by size on denaturing 

1 5 polyacrylamide gels to determine whether the control and sample nucleic acids have an 
identical nucleotide sequence or in which nucleotides the> are different. See, for example, 
Cotton et al (1988) Proc. Natl Acad Sci USA 85:4397; Saleeba et al (1992) Methods 
Enzymod. 217:286-295. In a preferred embodiment, the control or sample nucleic acid is 
labeled for detection. 

20 In another embodiment, an allelic variant can be identified by denaturing 

high-performance liquid chromatography (DHPLC; see Examples, infra) (Oefner and 
Underhill, (1995) Am. J. Human Gen. 57:Suppl. A266). In general, PCR products are 
produced using PCR primers flanking the DNA of interest. DHPLC analysis is carried out 
and the resulting chromatograms are analyzed to identify base pair alterations or deletions 

25 based on specific chromatographic profiles (see O'Donovan et al (1998) Genomics 52:44- 
49). 

In other embodiments, alterations in electrophoretic mobility is used to 
identify the type of DISCI or DISC2 allelic variant. For example, single strand 
conformation polymorphism (SSCP; see Examples, infra) may be used to detect differences 
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in electrophoretic mobility between mutant and wild type nucleic acids (Orita et ah (1989) 
Proc Nath Acad Sci USA 86:2766, see also Cotton (1993) Mutat Res 285:125-144; and 
Hayashi (1992) Genet Anal Tech Appl 9:73-79). Single-stranded DNA fragments of sample 
and control nucleic acids are denatured and allowed to renature. The secondary structure of 
5 single- stranded nucleic acids varies according to sequence, the resulting alteration in 
electrophoretic mobility enables the detection of even a single base change. The DNA 
fragments may be labeled or detected with labeled probes. The sensitivity of the assay may 
be enhanced by using RNA (rather than DNA), in which the secondary structure is more 
sensitive to a change in sequence. In another preferred embodiment, the subject method 

10 utilizes heteroduplex analysis to separate double stranded heteroduplex molecules on the 
basis of changes in electrophoretic mobility (Keen et ah (199i) Trends Genet 7:5). 

In yet another embodiment, the identity of an allelic variant of a polymorphic 
region is obtained by analyzing the movement of a nucleic acid comprising the polymorphic 
region in polyacry lamide gels containing a gradient of denatuxant is assayed using denaturing 

15 gradient gel electrophoresis (DGGE) (Myers et al (1985) Nature 313:495). When DGGE 
is used as the method of analysis, DNA will be modified to insure that it does not completely 
denature, for example by adding a GC clamp of approximately 40 bp of high-melting GO 
rich DNA by PCR. In a further embodiment, a lemperature gradient is used in place of a 
denaturing agent gradient to identify differences in the mobility of control and sample DNA 

20 (Rosenbaum and Reissner (1987) Biophys Chem 265 . 1 275). 

Examples of techniques for detecting differences of at least one nucleotide 
between 2 nucleic acids include, but are not limited to, selective oligonucleotide 
hybridization, selective amplification, or selective primer extension. For example, 
oligonucleotide probes may be prepared in which the known polymorphic nucleotide is 

25 placed centrally (allele-specific probes) and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et ah (1986) Nature 
324:163); Saiki et al (1989) Proc. Natl Acad Sci USA 86:6230; and Wallace et ah (1979) 
Nuch Acids Res. 6:3543). Such allele specific oligonucleotide hybridization techniques may 
be used for the simultaneous detection of several nucleotide changes in different 
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polymorphic regions of the DISCI or DISC2 gene. For example, oligonucleotides having 
nucleotide sequences of specific allelic variants are attached to a hybridizing membrane and 
this membrane is then hybridized with labeled sample nucleic acid. Analysis of the 
hybridization signal will then reveal the identity of the nucleotides of the sample nucleic 
5 acid. 

Alternatively, allele specific amplification technology which depends on 
selective PCR amplification may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific amplification may carry the allelic variant of 
interest in the center of the molecule (so that amplification depends on differential 

10 hybridization) (Gibbsetal(l989)Nucleic Acids Res. 17:2437-2448) or at the extreme 3' end 
of one primer where, under appropriate conditions, mismatch can prevent, or reduce 
polymerase extension (Prossner (1993) Tibtech 1 1 :238; Newton et al (1989) Nucl Acids 
Res. 17:2503). This technique is also termed "PROBE" for Probe Oligo Base Extension. 
In addition it may be desirable to introduce a novel restriction site in the region of the 

15 mutation to create cleavage-based detection (Gasparini et al (1 992) Mol Cell Probes 6:1). 

In another embodiment, identification of the allelic variant is carried out using 
an oligonucleotide ligation assay (OLA), as described, e.g., in U.S. Pat. No. 4,998,617 and 
in Landegren, U. et al, (1988) Science 241:1077-1080 The OLA protocol uses two 
oligonucleotides which are designed to be capable of hybridizing to abutting sequences of 

20 a single strand of a target. One of the oligonucleotides is linked to a separation marker, e.g,. 
biotinylated, and the other is detectably labeled. If the precise complementary sequence is 
found in a target molecule, the oligonucleotides will hybridize such that their termini abut, 
and create a ligation substrate. Ligation then permits the labeled oligonucleotide to be 
recovered using avidin, or another biotin ligand. Nickerson, D. A. et al have described a 

25 nucleic acid detection assay that combines attributes of PCR and OLA (Nickerson, D. A. et 
al., (1990) Proc. Natl Acad. Set (U.S.A.) 87:8923-8927. In this method, PCR is used to 
achieve the exponential amplification of target DNA, which is then detected using OLA. 

Several techniques based on this OLA method have been developed and can 
be used to detect specific allelic variants of a polymorphic region of a DISCI or DISC2 gene. 
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For example, U.S. Patent No. 5593826 discloses an OLA using an oligonucleotide having 
3'-amino group and a S'-phosphorylated oligonucleotide to form a conjugate having a 
phosphoramidate linkage. In another variation of OLA described in Tobe et al ((1996) 
Nucleic Acids Res 24: 3728), OLA combined with PCR permits typing of two alleles in a 
5 single microtiter well. By marking each of the allele-specific primers with a unique hapten, 
i.e. digoxigenin and fluorescein, each OLA reaction can be detected by using hapten specific 
antibodies that are labeled with different enzyme reporters, alkaline phosphatase or 
horseradish peroxidase. This system permits the detection of the two alleles using a high 
throughput format that leads to the production of two different colors, 

10 The invention further provides methods for detecting single nucleotide 

polymorphisms (SNPs) in a DISCI or DISC2 gene. Because single nucleotide 
polymorphisms constitute sites of variation flanked by regions of invariant sequence, their 
analysis requires no more than the determination of the identity of the single nucleotide 
present at the site of variation and it is unnecessary to determine a complete gene sequence 

15 for each patient. Several methods have been developed to facilitate the analysis of such 
single nucleotide polymorphisms. 

In one embodiment, the single base polymorphism can be detected by using 
a specialized exonuclease-resistant nucleotide, as disclosed, e.g., in Mundy, C. R. (U.S. Pat. 
No. 4,656,127). According to the method, a primer complementary to the allelic sequence 

20 immediately 3 ' to the polymorphic site i s permitted to hybridize to a target molecule obtained 
from a particular animal or human. If the polymorphic site on the target molecule contains 
a nucleotide that is complementary to the particular exonuclease-resistant nucleotide 
derivative present, then that derivative will be incorporated onto the end of the hybridized 
primer. Such incorporation renders the primer resistant to exonuclease, and thereby permits 

25 its detection. Since the identity of the exonuclease-resistant derivative of the sample is 
known, a finding that the primer has become resistant to exonucleases reveals that the 
nucleotide present in the polymorphic site of the target molecule was complementary to that 
of the nucleotide derivative used in the reaction. This method has the advantage that it does 
not require the determination of large amounts of extraneous sequence data. 
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In another embodiment of the invention, a solution-based method is used for 
determining the identity of the nucleotide of a polymorphic site. Cohen, D. et al (French 
Patent 2,650,840; PCT Appln. No. WO91/02087). As in the Mundy method of U.S. Pat. No. 
4,656,127, a primer is employed that is complementary to allelic sequences immediately 3' 
5 to a polymorphic site. The method determines the identity of the nucleotide of that site using 
labeled dideoxynucleotide derivatives, which, if complementary to the nucleotide of the 
polymorphic site will become incorporated onto the terminus of the primer. 

An alternative method, known as Genetic Bit Analysis ("GBA") is described 
by Gcelet, P. et al (PCT Appln. No. 92/15712). The method of Goelet, P. et al uses 

1 0 mixtures of labeled terminators and a primer that is complementary to the sequence 3' to a 
polymorphic site. The labeled terminator that is incorporated is thus determined by, and 
complementary to, the nucleotide present in the polymorphic site of the target molecule 
being evaluated. In contrast to the method of Cohen et al (French Patent 2,650,840; PCT 
Appln, No. W09 1/02087) the method of Goelet, P. et al is preferably a heterogeneous phase 

15 assay, in which the primer or the target molecule is immobilized to a solid phase. 

Recently, several primer-guided nucleotide incorporation procedures for 
assaying polymorphic sites in DNA have been described (Komher, J. S. et a/., Nucl. Acids. 
Res. 17:7779-7784 (1989); Sokolov, B. P., NucL Acids Res. 18:3671 (1990); Syvanen, A. 
"C, et al, Genomics 8:684-692 (1990); Kuppuswamy, M. N. et al, Proc. Natl. Acad. Sci. 

20 (U.S.A.) 88:1143-1147 (1991); Prezant, T. R. et al , Hum. Mutat. 1:159-164 (1992); 
Ugozzoli, L. et al , GAT A 9: 1 07- 1 1 2 ( 1 992); Nyren, P. ei al , Anal. Biochem. 208:171-175 
(1993)). These methods differ from GBAO in that they all rely on the incorporation of 
labeled deoxynucleotides to discriminate between bases at a polymorphic site. In such a 
format, since the signal is proportional to the number of deoxynucleotides incorporated, 

25 polymorphisms that occur in runs of the same nucleotide can result in signals that are 
proportional to the length of the run (Syvanen, A. -C, et al , Amer. J. Hum. Genet. 52:46-59 
(1993)). 
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For determining the identity of the allelic variant of a polymorphic region 
located in the coding region of a DISCI or DISC2 gene, yet other methods than those 
described above can be used. For example, identification of an allelic variant which encodes 
a mutated DISC 1 protein can be performed by using an antibody specifically recognizing the 
5 mutant protein in, e.g., immunohistochemistry or immunoprecipitation. Antibodies to wild- 
type DISCI protein or mutated forms of DISCI proteins can be prepared according to 
methods known in the art and are also described herein. Preferred antibodies specifically 
bind to a human DISCI protein comprising any of the amino acid substitutions set forth in 
TABLE 5A. Alternatively, one can also measure an activity of a wild -type or mutant DISCI 

1 0 protein, such binding or agonist/antagonist activity. Such assays are known in the art. For 
example, a ligand to the DISCI protein can be mixed with both wild-type and mutant DISC 1 
protein to evaluate whether ligand binding of the mutant protein differs from ligand binding 
to the wild-type protein. 

Antibodies directed against wild type or mutant DISC 1 polypeptides or allelic 

15 variant thereof, which are discussed above, may < also be used in disease diagnostics and 
prognostics. Such diagnostic methods, may be used to detect abnormalities in the level of 
DISCI polypeptide expression, or abnormalities in the structure and/or tissue, cellular, or 
subcellular location of a DISCI polypeptide. Structural differences may include, for 
example, differences in the size, electronegativity, or antigenicity of the mutant DISCI 

20 polypeptide relative to the wild-type polypeptide. Protein from the tissue or cell type to be 
analyzed may easily be detected or isolated using techniques which are well known to one 
of skill in the art, including but not limited to western blot analysis. For a detailed 
explanation of methods for carrying out Western blot analysis, see Sambrook et al 9 1989, 
supra, at Chapter 1 8. The protein detection and isolation methods employed herein may also 

25 be such as those described in Harlow and Lane, for example, (Harlow, E. and Lane, D., 1988, 
"Antibodies: A Laboratory Manual", Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York), which is incorporated herein by reference in its entirety. 

This can be accomplished, for example, by immunofluorescence techniques 
employing a fluorescently labeled antibody (see below) coupled with light microscopic, flow 
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cytometric, or fluorimetric detection. The antibodies (or fragments thereof) useful in the 
present invention may, additionally, be employed histologically, as in immunofluorescence 
or immunoelectron microscopy, for in situ detection of DISCI polypeptides. In situ 
detection may be accomplished by removing a histological specimen from a patient, and 
5 applying thereto a labeled antibody of the present invention. The antibody (or fragment) is 
preferably applied by overlaying the labeled antibody (or fragment) onto a biological sample. 
Through the use of such a procedure, it is possible to determine not only the presence of the 
DISCI polypeptide, but also its distribution in the examined tissue. Using the present 
invention, one of ordinary skill will readily perceive that any of a wide variety of histological 
10 methods (such as staining procedures) can be modified in order to achieve such in situ 
detection. 

Often a solid phase support or carrier is used as a support capable of binding 
an antigen or an antibody. Well-known supports or carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, 

1 5 polyacrylamides, gabbros, and magnetite. The nature of the carrier can be either soluble to 
some extent or insoluble for the purposes of the present invention. The support material may 
have virtually any possible structural configuration so long as the coupled molecule is 
capable of binding to an antigen or antibody. Thus, the support configuration may be 
spherical, as in a bead, or cylindrical, as in the inside surface of a test tube, or the external 

20 surface of a rod. Alternatively, the surface may be flat such as a sheet, test strip, etc. 
Preferred supports include polystyrene beads- Those skilled in the art will know many other 
suitable carriers for binding antibody or antigen, or will be able to ascertain the same by use 
of routine experimentation. 

One means for labeling an anti-DISCl polypeptide specific antibody is via 

25 linkage to an enzyme and use in an enzyme immunoassay (EIA) (Voller, "The Enzyme 
Linked Immunosorbent Assay (ELISA)", Diagnostic Horizons 2:1 -7, 1 978, Microbiological 
Associates Quarterly Publication, Walkersville, MD; Voller, et al,J. Clin. Pathol. 31 :507- 
520 (1978); Butler, Meth. Enzymol. 73:482-523 (1981); Maggio, (ed.) Enzyme 
Immunoassay, CRC Press, Boca Raton, FL, 1980; Ishikawa, et al, (eds.) Enzyme 
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Immunoassay, Kgaku Shoin, Tokyo, 1981). The enzyme which is bound to the antibody will 
react with an appropriate substrate, preferably a chromogenic substrate, in such a manner as 
to produce a chemical moiety which can be detected, for example, by spectrophotometry 
fluorimetric or by visual means. Enzymes which can be used to detectably label the antibody 
5 include, but are not limited to, malate dehydrogenase, staphylococcal nuclease, delta-5- 
steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate, dehydrogenase, 
triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, asparaginase, 
glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate 
dehydrogenase, glucoamylase and acetylcholinesterase. The detection can be accomplished 

1 0 by colorimetric methods which employ a chromogenic substrate for the enzyme. Detection 
may also be accomplished by visual comparison of the extent of enzymatic reaction of a 
substrate in comparison with similarly prepared standards. 

Detection may also be accomplished using any of a variety of other 
immunoassays. For example, by radioactively labeling the antibodies or antibody fragments, 

] 5 it is possible to detect fingerprint gene wild type or mutant peptides through the use of a 
radioimmunoassay (RIA) (see, for example, Weintraub, B., Principles of 
Radioimmunoassays, Seventh Training Course on Radioligand Assaj Techniques, The 
Endocrine Society, March, 1986, which is incorporated by reference herein). The radioactive 
isotope can be detected by such means as the use of a gamma counter or a scintillation 

20 counter or by autoradiography. 

It is also possible to label the antibody with a fluorescent compound. When 
the fluorescently labeled antibody is exposed to light of the proper wave length, its presence 
can then be detected due to fluorescence. Among the most commonly used fluorescent 
labeling compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin, 

25 allophycocyanin, o-phthaldehyde and fluorescamine. 

The antibody can also be detectably labeled using fluorescence emitting 
metals such as l52 Eu, or others of the lanthanide series. These metals can be attached to the 
antibody using such metal chelating groups as diethylenetriaminepentacetic acid (DTP A) or 
ethylenediaminetetraacetic acid (EDTA). 
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The antibody also can be detectably labeled by coupling it to a 
chemiluminescent compound. The presence of the chemiluminescent-tagged antibody is 
then determined by detecting the presence of luminescence that arises during the course of 
a chemical reaction. Examples of particularly useful chemiluminescent labeling compounds 
5 are luminol, isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate 
ester. 

Likewise, a bioluminescent compound may be used to label the antibody of 
the present invention. Bioluminescence is a type of chemiluminescence found in biological 
systems in, which a catalytic protein increases the efficiency of the chemiluminescent 
1 0 reaction. The presence of a bioluminescent protein is determined by detecting the presence 
of luminescence. Important bioluminescent compounds for purposes of labeling are 
luciferin, luciferase and aequorin. 

Moreover, it will be understood that any of the above methods for detecting 
alterations in a gene or gene product or polymorphic variants can be used to monitor the 
1 5 course of treatment or therapy. 

If a polymorphic region is located in an exon, either in a coding or non-coding 
portion of the gene, the identity of the allelic variant can be determined by determining the 
molecular structure of the mRNA, pre-roRNA, or cDNA. The molecular structure can be 
determined using any of the above described methods for determining the molecular 
20 structure of the genomic DNA, e.g., DHPLC, sequencing and SSCP. 

The methods described herein may be performed, for example, by utilizing 
pre-packaged diagnostic kits, such as those described above, comprising at least one probe 
or primer nucleic acid described herein, which may be conveniently used, e.g., to determine 
whether a subject has or is at risk of developing a disease associated with a specific DISCI 
25 or DISC2 allelic variant. 

Sample nucleic acid for using in the above-described diagnostic and 
prognostic methods can be obtained from any cell type or tissue of a subject. For example, 
a subject's bodily fluid (e.g. blood) can be obtained by known techniques (e.g. venipuncture). 
Alternatively, nucleic acid tests can be performed on dry samples (e.g. hair or skin). Fetal 
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nucleic acid samples can be obtained from maternal blood as described in International 
Patent Application No. W09 1/07660 to Bianchi. Alternatively, amniocytes or chorionic villi 
may be obtained for performing prenatal testing. 

Diagnostic procedures may also be performed in situ directly upon tissue 
5 sections (fixed and/or frozen) of patient tissue obtained from biopsies or resection s, such that 
no nucleic acid purification is necessary. Nucleic acid reagents may be used as probes and/or 
primers for such in situ procedures (see, for example, Nuovo, G.J., 1992, PCR in situ 
hybridization: protocols and applications, Raven Press, NY). 

In addition to methods which focus primarily on the detection of one nucleic 
1 0 acid sequence, profiles may also be assessed in such detection schemes. Fingerprint profiles 
may be generated, for example, by utilizing a differential display procedure, Northern 
analysis and/or RT-PCR. 

Ph armacogen omics 

1 5 Knowledge of the identity of the allele of one or more DISC 1 or DISC2 gene 

polymorphic regions in an individual (the DISCI or DISC2 genetic profile), alone or in 
conjunction with information on other genetic defects contributing to the same disease (the 
genetic profile of the particular disease) also allows a customization of the therapy for a 
particular disease to the individual's genetic profile, the goal of ''pharmacogenomics". For 

20 example, subjects having a specific allele of a DISC 1 or DISC2 gene may or may not exhibit 
symptoms of a particular disease or be predisposed to developing symptoms of a particular 
disease. Further, if those subjects are symptomatic, they may or may not respond to a 
certain drug, e.g., a specific DISCI or DISC2 therapeutic, such as, e.g., an inhibitor of 
DISCI or DISC2 activity or binding, but may respond to another. Thus, generation of a 

25 DISCI or DISC2 genetic profile, (e.g., categorization of alterations in DISCI or DISC2 
genes which are associated with the development of a particular disease), from a population 
of subjects, who are symptomatic for a disease or condition that is caused by or contributed 
to by a defective and/or deficient DISCI or DISC2 gene and/or protein (a DISCI or DISC2 
genetic population profile) and comparison of an individual's DISC 1 or DISC2 profile to the 
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population profile, permits the selection or design of drugs that are expected to be safe and 
efficacious for a particular patient or patient population (i.e., a group of patients having the 
same genetic alteration). 

For example, a DISCI or DISC2 population profile can be performed by 
5 determining the DISCI or DISC2 profile, e.g., the identity of DISCI or DISC2 alleles, in a 
patient population having a disease, which is associated with one or more specific alleles of 
DISCI or DISC2 polymorphic regions. Optionally, the DISCI or DISC2 population profile 
can further include information relating to the response of the population to a DISCI or 
DISC2 therapeutic, using any of a variety of methods, including, monitoring: 1) the severity 

1 0 of symptoms associated with the DISCI or DISC2 related disease, 2) DISC 1 or DISC2 gene 
expression level, 3) DISCI or DISC2 mRJNA level, 4) DISCI protein level, 5) eosinophil 
level, and/or 6) leukotriene level, and dividing or categorizing the population based on 
particular DISCI or DISC2 alleles. The DISCI or DISC2 genetic population profile can 
also, optionally, indicate those particular DISCI or DISC2 alleles which are present in 

15 patients that are either responsive or non-responsive to a particular therapeutic. This 
information or population profile, is then useful for predicting which individuals should 
respond to particular drugs, based on their individual DISCI or DISC2 profile. 

In a preferred embodiment, the DISCI or DISC2 profile is a transcriptional 
or expression level profile and step (i) is comprised of determining the expression level of 

20 DISCI proteins, alone or in conjunction with the expression level of other genes known to 
contribute to the same disease at various stages of the disease. 

Pharmacogenomic studies can also be performed using transgenic animals. 
For example, one can produce transgenic mice, e.g., as described herein, which contain a 
specific allelic variant of a DISCI or DISC2 gene. These mice can be created, e.g., by 

25 replacing their wild-type DISCI or DISC2 gene with an allele of the human DISCI or 
DISC2 gene. The response of these mice to specific DISCI or DISC2 therapeutics can then 
be determined. 
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Methods o f Treatment 

The present invention provides for both prophylactic and therapeutic methods 
of treating a subject having or likely to develop a disorder associated with specific DISCI 
or DISC2 alleles and/or aberrant DISCI expression or activity, e.g., disorders or diseases 
5 associated with aberrant neurological functions, such as neuropsychiatric diseases or 
disorders. 

The DISCI or DISC2 nucleic acid molecules, polypeptides and antibodies of 
the present invention may be used, for example, in therapeutic methods to treat disorders, 
such as neuropsychiatric disorder (including, for example, schizophrenia, schizoaffective 

1 0 disorder, bipolar affective disorder, unipolar affective disorder, attention deficit disorder, and 
adolescent conduct disorder). In addition, compounds that bind to a DISCI or DISC2 
nucleic acid or polypeptide of the invention, compounds that modulate DISCI or DISC2 
gene expression, and compounds that interfere with or modulate binding of a DISCI or 
DISC2 nucleic acid or polypeptide with a binding compound may be useful, e.g. , in methods 

1 5 for treating such neuropsychiatric disorders. 

For example, in a preferred embodiment, compounds that specifically bind 
to variant DISC 1 or DISC2 nucleic acid of the present invention or, alternative!)' , compounds 
that specifically bind to a variant DISCI or DISC2 gene product encoded by such a nucleic 
acid molecule may be used to inhibit the expression or activity of that variant DISCI or 

20 DISC2 gene or gene product, while not inhibiting the expression or activity of a wild-type 
DISCI or DISC2 gene or its gene product. 

Prophylactic Methods. In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition associated with a specific DISCI or DISC2 
25 allele and/or an aberrant DISC 1 or DISC2 expression or activity, such as a neuropsychiatric 
disorder, e.g., schizophrenia, and medical conditions resulting therefrom, by administering 
to the subject an agent which counteracts the unfavorable biological effect of the specific 
DISC 1 or DISC2 allele. Subjects at risk for such a disease can be identified by a diagnostic 
or prognostic assay, e.g., as described herein. Administration of a prophylactic agent can 
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occur prior to the manifestation of symptoms associated with specific DISCI or DISC2 
alleles, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the identity of the DISCI or DISC2 allele in a subject, a 
compound that counteracts the effect of this allele is administered. The compound can be 
5 a compound modulating the activity of a DISCI polypeptide or DISC2 RNA, e.g., an 
inhibitor. The treatment can also be a specific diet, or environmental alteration. In 
particular, the treatment can be undertaken prophylactically, before any other symptoms are 
present. Such a prophylactic treatment could thus prevent the development of an aberrant 
neurological function or aberrant neuropsychiatric profile such as those displayed in , e.g., 
10 schizophrenia, schizoaffective disorder, bipolar disorder, unipolar affective disorder and 
adolescent conduct disorder. The prophylactic methods are similar to therapeutic methods 
of the present invention and are further discussed in the following subsections. 

Therapeutic Methods. The invention further provides methods of treating 
15 subjects having a disease or disorder associated with a specific allelic variant of a 
polymorphic region of a DISCI or DISC2 gene Preferred diseases or disorders include 
those associated with aberrant neurological function, and disorders resulting therefrom (e g. , 
neuropsychiatric diseases and disorders, such as, for example, schizophrenia, schizoaffective 
disorder, bipolar disorder, unipolar affective disorder and adolescent conduct disorder). 
20 In one embodiment, the method comprises (a) determining the identity of the 

allelic variant; and (b) administering to the subject a compound that compensates for the 
effect of the specific allelic variant. The polymorphic region can be localized at any location 
of the gene, e.g., in the promoter (e.g., in a regulatory element of the promoter), in an exon, 
(e.g., coding region of an exon), in an intron, or at an exon/intron border. Thus, depending 
25 on the site of the polymorphism in the DISCI or DISC2 gene, a subject having a specific 
variant of the polymorphic region which is associated with a specific disease or condition, 
can be treated with compounds which specifically compensate for the allelic variant. 

In a preferred embodiment, the identity of one or more of the nucleotides of 
a DISCI gene identified in TABLE 5 can be determined. 
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In a particularly preferred embodiment, it is determined that a subject has one 
or more of the following (WT/SNP): C/T at position 1 872; and ATT at position 2 1 63 ; of SEQ 
ID NO: 1, respectively. 

If a subject has one or more of the polymorphisms of the invention (see 
5 TABLE 5), that subject can have or be predicted to be at risk for developing a 
neuropsychatric disorder, e.g. schizophrenia. The neuropsychiatric disorder can be prevented 
from occurring or can be reduced by administering to the subject a pharmaceutically 
effective amount of a compound found to inhibit the activity or binding of the DISCI 
polypeptide or DISC2 gene product, or modifies the transcription or expression of the DISC 1 

10 orDISC2gene. 

Generally, the allelic variant can be a mutant allele, /. e., an allele which when 
present in one, or preferably two copies, in a subject results in a change in the phenotype of 
the subject. A mutation can be a substitution, deletion, and/or addition of at least one 
nucleotide relative to the wild-type allele (/. e., the reference sequence). Depending on where 

1 5 the mutation is located in the DISC 1 or DISC2 gene, the subject can be treated to specifically 
compensate for the mutation. For example, if the mutation is present in the coding region 
of the DISCI gene and results in a mGre active DISCI protein, the subject can be treated, 
e.g., by admini station to the subject of a DISCI inhibitor, such that the administration of an 
.inhibitor prevents aberrant neurological function associated with the DISCI protein. In 

20 addition, wild-type DISCI protein or nucleic acid coding sequence/cDNA can be 
administered to compensate for the endogenous mutated form of the DISCI protein. 
Likewise, an inhibitor or compensator can be administered to counteract any abberant 
activity of a DISC2 RNA. Nucleic acids encoding wild-type human DISCI protein are set 
forth in SEQ ID NO:2. Furthermore, depending on the site of the mutation in the DISCI 

25 protein and the specific effect on its activity, specific treatments can be designed to 
compensate for that effect. 

Yet in another embodiment, the invention provides methods for treating a 
subject having a mutated DISCI or DISC2 gene, in which the mutation is located in a 
regulatory region of the gene. Such a regulatory region can be localized in the promoter of 
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the gene, in the 5' or 3' untranslated region of an exon, or in an intron. A mutation in a 
regulatory region can result in increased production of DISCI protein or DISC2 RNA, 
decreased production of DISCI protein or DISC2 RNA, or production of DISCI protein or 
DISC2 RNA having an aberrant tissue distribution. The effect of a mutation in a regulatory 
5 region upon the DISCI protein or DISC2 RNA can be determined, e.g., by measuring the 
protein level or mRNA level in cells having DISCI and DISC2 genes with such mutations 
and which, normally (Le. f in the absence of the mutation) produce DISCI protein or DISC2 
RNA. The effect of a mutation can also be determined in vitro. For example, if the mutation 
is in the promoter, a reporter construct can be constructed which comprises the mutated 

1 0 promoter linked to a reporter gene, the construct transfected into cells, and comparison of the 
level of expression of the reporter gene under the control of the mutated promoter and under 
the control of a wild-type promoter. Such experiments can also be carried out in mice 
transgenic for the mutated promoter. If the mutation is located in an intron, the effect of the 
mutation can be determined, e.g., by producing transgenic animals in which the mutated 

1 5 DISC 1 or DISC2 gene has been introduced and in which the wild-type gene may have been 
knocked out, Comparison of the level of expression/transcription of DISC 1 or DISC2 in the 
mice transgenic for the mutant human DISCI or DISC2 gene with mice transgenic foi a 
wild-type human DISCI or DISC2 gene will reveal whether the mutation results in 
increased, decreased synthesis of the corresponding protein and/or aberrant tissue distribution 

20 of the protein or RNA. Such analysis could also be performed in cultured cells, in which the 
human mutant DISCI or DISC2 gene is introduced and, e.g., replaces the endogenous wild- 
type gene in the cell. Thus, depending on the effect of the mutation in a regulatory region 
of a DISC 1 or DISC2 gene, a specific treatment can be administered to a subject having such 
a mutation. Accordingly, if the mutation results in increased DISCI protein or DISC2 RNA 

25 levels, the subject can be treated by administration of a compound which reduces 
DISC1/DISC2 expression, e.g., by reducing gene expression or translation or a compound 
which inhibits or reduces the activity of the DISCI protein. 
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Furthermore, it is likely that subjects having different allelic variants of a 
DISCI or DISC2 polymorphic region will respond differently to therapeutic drugs to treat 
diseases or conditions, such as those associated with neuropsychiatric disorders. 

A correlation between drug responses and specific alleles of DISC 1 or DISC2 
5 can be shown, for example, by clinical studies wherein the response to specific drugs of 
subjects having different allelic variants of a polymorphic region of a DISC 1 or DISC2 gene 
is compared. Such studies can also be performed using animal models, such as mice having 
various alleles of human DISCI or DISC2 genes and in which, e.g., the endogenous DISCI 
or DISC2 gene has been inactivated such as by a knock-out mutation. Test drugs are then 

10 administered to the mice having different human DISCI or DISC2 alleles and the response 
of the different mice to a specific compound is compared. Accordingly, the invention 
provides assays for identifying the drug which will be best suited for treating a specific 
disease or condition in a subject. For example, it will be possible to select drugs which will 
be devoid of toxicity, or have the lowest level of toxicity possible for treating a subject 

1 5 having a disease or condition. 



Monitoring Clinical Therapies. The ability to target populations expected 
to show the highest clinical benefit, based on the neurological activity or disease genetic 
profile, can enable: 1) the repositioning of marketed drugs with disappointing market results; 

20 2) the rescue of drug candidates whose clinical development has been discontinued as a 
result of safety or efficacy limitations, which are patient subgroup-specific; and 3) an 
accelerated and less costly development for drug candidates and more optimal drug labeling 
(e.g., since the use of DISCI or DISC2 as a marker is useful for optimizing effective dose). 

In situations in which the disease associated with a specific DISCI or DISC2 

25 allele is characterized by an abnormal protein or RNA expression, the treatment of an 
individual with a DISCI or DISC2 therapeutic can be monitored by determining DISCI or 
DISC2 characteristics, such as DISCI protein level or activity, DISCI or DISC2 mRNA 
levels, and/or transcriptional levels. This measurement will indicate whether the treatment 
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is effective or whether it should be adjusted or optimized. Thus, DISCI or DISC2 can be 
used as a marker for the efficacy of a drug during clinical trials. 

In a preferred embodiment, the present invention provides a method for 
monitoring the effectiveness of treatment of a subject with an agent {e.g., an agonist, 
5 antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate identified by the screening assays described herein) comprising the steps of (i) 
obtaining a preadministration sample from a subject prior to administration of the agent; (ii) 
detecting the level of expression of a DISCI protein, DISCI or DISC2 mRNA, or genomic 
DNA in the preadministration sample; (iii) obtaining one or more post-administration 

10 samples from the subject; (iv) detecting the level of expression or activity of the DISCI 
protein, DISCI or DISC2 RNA, or genomic DNA in the post-administration samples; (v) 
comparing the level of expression or activity of the DISC 1 protein, DISC 1/DISC2 RNA, or 
genomic DNA in the preadministration sample with the DISCI protein, DISCI or DISC2 
mRNA, or genomic DNA in the post administration sample or samples; and (vi) altering the 

1 5 administration of the agent to the subject accordingly. For example, increased administration 
of the agent may be desirable to increase the expression or activity of DISCI or DISC2 to 
higher levels than detected, i.e., to increase the effectiveness of the agent. Alternatively, 
decreased administration of the agent may be desirable to decrease expression or activity of 
DISCI or DISC2 to lower levels than detected, i.e., to decrease the effectiveness of the agent. 

20 Cells of a subject may also be obtained before and after administration of a 

DISCI or DISC2 therapeutic to detect the level of expression of genes other than the DISCI 
or DISC2 gene, to verify that the therapeutic does not increase or decrease the expression of 
genes which could be deleterious. This can be done, e.g., by using the method of 
transcriptional profiling. Thus, mRNA from cells exposed in vivo to a DISCI or DISC2 

25 therapeutic and mRNA from the same type of cells that were not exposed to the therapeutic 
could be reverse transcribed and hybridized to a chip containing DNA from numerous genes, 
to thereby compare the expression of genes in cells treated and not treated with a DISC 1 or 
DISC2 therapeutic. If, for example a DISCI or DISC2 therapeutic turns on the expression 
of a proto-oncogene in an individual, use of this particular therapeutic may be undesirable. 
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Other Uses 

The identification of different alleles of DISCI or DISC2 can also be useful 
for identifying an individual among other individuals from the same species. For example, 
DNA sequences can be used as a fingerprint for detection of different individuals within the 
5 same species (Thompson, J. S. and Thompson, eds., Genetics in Medicine, WB Saunders 
Co., Philadelphia, PA (1 991)). This is useful, for example, in forensic studies and paternity 
testing, as described below. 

Forensics Applications. Determination of which specific allele occupies a 

1 0 set of one or more polymorphic sites in an individual identifies a set of polymorphic forms 
that distinguish the individual from others in the population. See generally National 
Research Council, The Evaluation of Forensic DNA Evidence (Eds. Pollard et al , National 
Academy Press, DC, 1996). The more polymorphic sites that are analyzed, the lower the 
probability that the set of polymorphic forms in one individual is the same as that in an 

1 5 unrelated individual. Preferably, if multiple sites are analyzed, the sites are unlinked. Thus, 
the polymorphisms of the invention can be used in conjunction with known polymorphisms 
in distal genes. Preferred polymorphisms for use in forensics are biallelic because the 
population frequencies of two polymorphic forms can usually be determined with greater 
accuracy than those of multiple polymorphic forms at multi-allelic loci. 

20 The capacity to identify a distinguishing or unique set of forensic markers in 

an individual is useful for forensic analysis. For example, one can determine whether a 
blood sample from a suspect matches a blood or other tissue sample from a crime scene by 
determining whether the set of polymorphic forms occupying selected polymorphic sites is 
the same in the suspect and the sample. If the set of polymorphic markers does not match 

25 between a suspect and a sample, it can be concluded (barring experimental error) that the 
suspect was not the source of the sample. If the set of markers is the same in the sample as 
in the suspect, one can conclude that the DNA from the suspect is consistent with that found 
at the crime scene. If frequencies of the polymorphic forms at the loci tested have been 
determined (e.g., by analysis of a suitable population of individuals), one can perform a 
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statistical analysis to determine the probability that a match of suspect and crime scene 
sample would occur by chance. 

p(ID) is the probability that two random individuals have the same 
polymorphic or allelic form at a given polymorphic site. For example, in biallelic loci, four 
genotypes are possible: AA, AB, B A, and BB. If alleles A and B occur in a haploid genome 
of the organism with frequencies x and y, the probability of each genotype in a diploid 
organism is (see WO 95/12607): 

Homozygote: p(AA)= x 2 

Homozygote: p(BB)= y 2 = (1-x) 2 

Single Heterozygote: p(AB)= p(BA)= xy = x(l-x) 

Both Heterozygotes: p(AB+BA)= 2xy = 2x(l-x) 

The probability of identity at one locus (Le., the probability that two 
individuals, picked at random from a population will have identical polymorphic forms at 
a given locus) is given by the equation: p(ID) = (x 2 ). 

These calculations can be extended for any number of polymorphic forms at 
a given locus. For example, the probability of identity p(ID) for a 3 -allele system where the 
alleles have the frequencies in the population of x, y, and z, respectively, is equal to the sum 
of the squares of the genotype frequencies: P(ID) = x 4 + (2xy) 2 + (2yz) 2 + (2xz) 2 + z 4 + y 4 . 

In a locus of n alleles, the appropriate binomial expansion is used to calculate 
p(ID) and p(exc). 

The cumulative probability of identity (cum p(ID)) for each of multiple 
unlinked loci is determined by multiplying the probabilities provided by each locus: 
cum p(ID) = p(IDl)p(ID2)p(ID3)...p(IDn). 

The cumulative probability of non-identity for n loci (i. e., the probability that 
two random individuals will be difference at 1 or more loci) is given by the equation: 
cum p(nonlD) = 1 -cum p(ID). 

If several polymorphic loci are tested, the cumulative probability of non- 
identity for random individuals becomes very high (e.g., one billion to one). Such 
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probabilities can be taken into account together with other evidence in determining the guilt 
or innocence of the suspect. 

Paternity Testing. The object of paternity testing is usually to determine 
5 whether a male is the father of a child. In most cases, the mother of the child is known, and 
thus, it is possible to trace the mother's contribution to the child's genotype. Paternity 
testing investigates whether the part of the child's genotype not attributable to the mother is 
consistent to that of the puntative father. Paternity testing can be performed by analyzing 
sets of polymorphisms in the putative father and in the child. 

10 If the set of polymorphisms in the child attributable to the father does not 

match the set of polymorphisms of the putative father, it can be concluded, barring 
experimental error, that that putative father is not the real father. If the set of polymorphisms 
in the child attributable to the father does match the set of polymorphisms of the putative 
father, a statistical calculation can be performed to determine the probability of a 

1 5 coincidental match. 

The probability of parentage exclusion (representing the probability that a 
random male will have a polymorphic form at a given polymorphic site that makes him 
incompatible as the father ) is given by the equation (see WO 95/12607): p(exc) = xy(l-xy), 
where x and y are the population frequencies of alleles A and B of a biallelic polymorphic 

20 site. 

(At a triallelic site p(exc) = xy(l-xy) + yz(l-yz) + xz(l-xz) + 3xyz(l-xyz)), 
where x, y, and z and the respective populations frequencies of alleles A, B, and C). 

The probability of non-exclusion is: p(non-exc) = l-p(exc). 

The cumulative probability of non-exclusion (representing the values obtained 
25 when n loci are is used) is thus: 

Cum p(non-exc) = p(non-excl)p(non-exc2)p(non-exc3)...p(non-excn). 

The cumulative probability of the exclusion for n loci (representing the 
probability that a random male will be excluded: cum p(exc) = 1 - cum p(non-exc). 
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If several polymorphic loci are included in the analysis, the cumulative 
probability of exclusion of a random male is very high. This probability can be taken into 
account in assessing the liability of a putative father whose polymorphic marker set matches 
the child's polymorphic marker set attributable to his or her father. 

5 

Kits. As set forth herein, the invention provides methods, e.g., diagnostic and 
therapeutic methods, e.g., for determining the type of allelic variant of a polymorphic region 
present in a DISCI or DISC2 gene, such as a human DISCI or DISC2 gene. In preferred 
embodiments, the methods use probes or primers comprising nucleotide sequences which 

10 are complementary to a polymorphic region of a DISCI or DISC2 gene (e.g., SEQ ID 
NOS:33-43). Accordingly, the invention provides kits for performing these methods. 

In a preferred embodiment, the invention provides a kit for determining 
whether a subject has or is at risk of developing a disease or condition associated with a 
specific allelic variant of a DISCI or DISC2 polymorphic region. In an even more preferred 

15 embodiment, the disease or disorder is characterized by an abnormal DISCI or DISC2 
activity. In an even more preferred embodiment, the invention provides a kit for determining 
whether a subject has or is at risk of developing a neuropsychiatric disease such as, e.g., 
schizophrenia, schizoaffective disorder, bipolar disorder, unipolar affective disorder and 
adolescent conduct disorder. 

20 A preferred kit provides reagents for determining whether a subject is likely 

to develop a neuropsychiatric disease such as, e.g., one of the aforementioned 
disorders/diseases. 

Preferred kits comprise at least one probe or primer which is capable of 
specifically hybridizing under stringent conditions to a DISCI or DISC2 sequence or 
25 polymorphic region and instructions for use. The kits preferably comprise at least one of the 
above described nucleic acids. Preferred kits for amplifying at least a portion of a DISCI 
or DISC2 gene, e.g., the 5' promoter region, comprise two primers, at least one of which is 
capable of hybridizing to an allelic variant sequence. Even more preferred kits comprise a 
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pair of primers selected from the group set forth in TABLE 5 and FIG. 4 (SEQ ID NOS :3 3- 1 27). 
The kits of the invention can also comprise one or more control nucleic acids or reference 
nucleic acids, such as nucleic acids comprising a DISCI or DISC2 intronic sequence. For 
example, a kit can comprise primers for amplifying a polymorphic region of a DISCI or 
5 DISC2 gene and a control DNA corresponding to such an amplified DNA and having the 
nucleotide sequence of a specific allelic variant. Thus, direct comparison can be performed 
between the DNA amplified from a subject and the DNA having the nucleotide sequence of 
a specific allelic variant. In one embodiment, the control nucleic acid comprises at least a 
portion of a DISCI or DISC2 gene of an individual who does not have a neuropsychiatric 

10 disease, aberrant neurological activity, or a disease or disorder associated with an aberrant 
neurological activity. 

Yet other kits of the invention comprise at least one reagent necessary to 
perform the assay. For example, the kit can comprise an enzyme. Alternatively the kit can 
comprise a buffer or any other necessary reagent. 

1 5 The present invention is further illustrated by the following examples which 

should not be construed as limiting in any way. The contents of all cited references 
(including, without limitation, literature references, issued patents, published patent 
applications) as cited throughout this application are hereby expressly incorporated by 
reference. The practice of the present invention will employ, unless otherwise indicated, 

20 conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, 
microbiology, recombinant DNA, and immunology, which are within the skill of the art. 
Such techniques are explained fully in the literature. See, for example, Molecular Cloning A 
Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor 
Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. Glover ed., 1985); 

25 Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al U.S. Patent No: 4,683,195; 
Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Transcription And 
Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I. 
Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. 
Perbal, A Practical Guide To Molecular Cloning (1984); the treatise, Methods In 
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Enzymology (Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. 
H. Miller and M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In 
Enzymology, Vols. 154 and 155 (Wu et al eds.), Immunochemical Methods In Cell And 
Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1 987); Handbook Of 
5 Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. Blackwell, eds., 1986); 
Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y., 1986). 



6. EXAMPLES 

1 0 The invention is also described by means of particular examples. However, 

the use of such examples anywhere in the specification is illustrative only and in no way 
limits the scope and meaning of the invention or of any exemplified term. Likewise, the 
invention is not limited to any particular preferred embodiments described herein. Indeed, 
many modifications and variations of the invention will be apparent to those skilled in the 

15 art upon reading this specification and can be made without departing from its spirit and 
scope. The invention is therefore to be limited only by the terms of the appended claims 
along with the full scope of equivalents to which the claims are entitled. 



EXAMPLE 1: LOP Analysis of a Region on Human Chromosome 1 Associated 
20 With Neuropsychiatric Disorders 

This Example describes the identification of a susceptibility locus for 

schizophrenia and related disorders resides on chromosome lq32-42, in the region of DISC 1 

and DISC2. 

Evidence for a neuropsychiatric disorder susceptibility locus within the 
25 chromosomal region of DISCI and DISC2 was derived from a linkage analysis of 
schizophrenia family data. An affected sibling pair linkage analysis was aimed to identify 
alleles (at polymorphic DNA markers) which were shared by affected siblings within a 
family (Kruglyak etal.,AmJHum Genet 1996;58:1347-1363). If increased allele sharing 
was identified across many families, and occurred more often than expected by chance, an 
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inference could be made that a disease susceptibility locus also mapped to the region of 
increased allele sharing. 

Materials and Methods 
5 Sample population. Nuclear families, with at least one sibling diagnosed 

with schizophrenia or schizoaffective disorder, and a second sibling with schizophrenia, 
schizoaffective disorders or a related psychotic disorder, were identified from nationwide 
registers in Finland (Hovattaefa/., Am J Hum Genet 1999;65:1114-24; Ekelund et al, Hum 
Mol Genet 2000;12:1049-1057). To facilitate a search for susceptibility loci underlying a 
10 narrow, intermediate or broad definition of schizophrenia, families were grouped into three 
classes. Specifically, prior to genetic analysis, each family was characterized as belonging 
to one, two, or three of three increasingly more inclusive diagnostic groups. These groups, 
as well as the number of families meeting the criteria for each class, are detailed in TABLE 
1. 

15 TABLE 1 

Families for Linkage Analysis of Schizophrenia 
Type of Family No. of 

Families 

1 : At least two siblings with schizophrenia 130 
2 : At least two siblings with schizophrenia or schizoaffective 164 
20 disorder 

3: At least two siblings with schizophrenia, schizoaffective disorder, 175 
schizophreniform disorder, delusional disorder, brief psychotic 
disorder or psychoses not otherwise specified (NOS) 

2 5 Genotyping. Genomic DN A samples were extracted from whole blood using 

standard methods (Blin and Stafford, Nucleic Acids Res 1976;3:2303-8). DNAs were 
quantitated by Hoechst dye fluorescence using a CytoFluor 3000 fluorimeter (PerSeptive 
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Biosystems, Framingham, MA), normalized at 40 ng/|il, and stored at 4°C. Microsatellite 
marker loci, that map to introns of DISCI, were selected from public databases. PCR 
primers, labeled for analysis on the ABI 3 77XL DNA sequencer (Applied Biosy stems, Foster 
City, CA), were prepared using standard oligonucleotide synthesis chemistry. Each genotype 
5 was double-scored, once by an expert human technician and once by a proprietary software 
package; incongruities between the two were resolved by the human scorer. Finished marker 
data for each pedigree were checked for Mendelian inheritance; raw data for all observed 
deviations were reevaluated. 



1 0 Linkage Analysis. Nonparametric affected sibling pair linkage analysis was 

carried out using Genehunter vl.O (Kruglyak et al 1996, supra) as modified by Kong and 
Cox (Am J Hum Genet 1997;61: 1179-1188). Allele frequencies were determined using 
actual or imputed founder genotypes. Evidence for linkage was determined from the LOD 
score and its associated p- value. The LOD score is calculated as the logarithm (base 10) of 

1 5 the odds of observing the data given linkage of the microsattellite marker to a susceptibility 
locus versus the odds of observing the data under the null hypothesis of no linkage. Thus, 
a LOD score of 2 is interpreted as 100: 1 odds in favor of linkage. 



Results 

20 Microsattellite markers D 1 S25 1 , D 1 S3462, and D 1 S2702, mapping to DISC 1 

introns 1, 8, and 9, were identified from public resources. The markers were genotyped in 
the schizophrenia family study DNA and analyzed for linkage to a neuropsychiatric disorders 
within the family sets defined in TABLE 1. Single marker LOD scores, as well as their 
corresponding p-values, are presented in TABLE 2. While these LOD scores do not meet 

25 criteria for genome wide significance (due to the adjustment to the p-value that must be made 
given the multiple tests performed across the genome), the LOD scores for D1S3462 and 
D1S2709 do reach significance for assessing a single candidate region (Lander and 
Kruglyak, Nat Genet 1996;11:241-7). Thus, this region is judged to be linked to 
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neuropsychiatric disorders and the strength of this linkage is greatest for the broad (Type 3) 
definition of schizophrenia. 

TABLE 2 

5 Single Marker LOD Scores for Markers Mapping to DISCI 

Marker LOD Scores 

Family Type 1 Family Type 2 Family Type 3 

D1S251 0.29 0.21 0.16 

D1S3462 1.67* 1.39* 1.31* 

10 D1S2709 1.88* 2.69** 2.88** 

* pO.OOl 

** p<0.005 

This Example thereby shows that the genomic sequences comprising DISCI 
and DISC2 are related to neuropsychiatric disorders (e.g., schizophrenia). The novel SNPs 
15 disclosed in this application are thus ones that can be used to diagnose and treat such 
disorders in a general population, and not just in a special pedigree. 

EXAMPLE 2: Determining the Genomic Structure for DISCI and DISC2 

So that the DISCI and DISC2 genes could be further analyzed for nucleotide 
20 variants and, in particular, for nucleotide variants which correlate with a neuropsychiatric 
disorder such as schizophrenia, schizoaffective disorder, bipolar affective disorder, unipolar 
affective disorder and adolescent conduct disorder, the genomic structure of these genes was 
first determined. Specifically, although the cDNA sequences of both the DISCI and the 
DISC2 genes have been previously described (see, in particular, Millar et aL, Human 
25 Molecular Genetics 2000, 9:1415-1423) their genomic sequences remain unknown, 
including their intron sequences, the intron-exon boundaries and 5'-promoter sequences of 
the DISCI and DISC2 genes. This example describes the elucidation of the genomic 
sequences and structures for these genes. 
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Briefly, the genomic structures of DISCI and DISC2 are determined by 
comparing the cDNA sequences shown in FIGS. 1 A-C (SEQ ID NO: 1 ) and in FIGS. 3A-G 
(SEQ ID NO:3), respectively for each of these genes to the sequences of large, genomic 
DNA clones (BACs) that contain genomic sequences for segments of human chromosome 
5 1 which contain DISCI and DISC2. Such comparisons were done using the Basic Local 
Alignment Search Tool (BLAST; see, for example, Karlin&Altschul, Proc. Natl. Acad. 
U.S.A. 1990,87:2267-2268; Altschul et al 9 J. Mol. Biol 1990,215:403-410; Altschul et 
al 9 Nature Genetics 1993,3:266-272; and Altschul et al. , Nucl Acids. Res. 1997,25:3389- 
3402) and Sequencher 3. LI (Gene Codes Corporation, Ann Arbor, Michigan). The DISCI 

10 and DISC2 cDNA sequences were compared to the sequences of BACs which are publicly 
available from the GenBank database, and identified in the following Table by their 
GenBank Accession number (including the appropriate version number) and their Genelnfo 
Identification (GI) number. The DISCI and DISC2 sequence was also compared to a BAC 
referred to herein as zlhXdisc43 or "disc43". The nucleotide sequence for this BAC is 

1 5 provided here in SEQ ID NO:4. 



TABLE 3 

BACs Used to Evaluate the Genomic Structure of DISCI and DISC2 



BAC 


GenBank 




Clone 


Accession No. 


GI No./ID 


RP11-17H4 


AC01 1655.4 


7239563 


RP1 1-9801 


AL359543.4 


8979569 


RP4-584N17 


AL136171.10 


9188229 


RP5-865N13 


AL353052.2 


9213047 


RP4-730B13 


AL161743.14 


9588180 


RP4-730B13 


AL161743.15 


9717048 



Using BLAST and Sequencher to align the DISCI and DISC2 cDNA 
sequences with the above BAC sequences, regions of identity were identified between the 
transcribed cDNA sequence and the sequences of the genomic region from which they were 
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transcribed. These regions of sequence identity correspond to exon regions of the DISC 1 and 
DISC2 genomic sequences, respectively. The boundaries of the exon regions were 
confirmed by the presence of known, conserved splice elements flanking these exon 
sequences. 

5 

TABLE 4 



Exon/Intron Boundaries of Partial DISCI Genomic Sequences 



Exon 


GINo. 


Residues 1 


Boundary Sequences 


SEQ ID NO: 


1 


7239563 


133392- 


GGAAGGAGCA ... 


5 and 6 






133273 


CACCGCGCAGGTAGGGGAGC 




2 


8979569 


26151- 


TTCTTCCCAGGCAGCCGGGA... 


7 and 8 






25168 


ATGGAGGTCAGTGTCTCTTC 




3 


8979569 


18020- 


ACCAACATAGGTAATATCCT... 


9 and 10 






17951 


TATGATAAAGGTGAGTTTTA 




4 


8979569 


73785- 


GGGTTTCCAGCTGAGACGTT... 


11 and 12 






73635 


CCACTCAGCAGAGAGTACTT 




5 


8979569 


135940- 


TTGTTTTAAGGGCCAGCGGA... 


13 and 14 






136069 


GCAGCTACAGGTGAGCAGGT 




6 


disc43- 


168- 


TTCTCTACAGAAAGAAATCG... 


15 and 16 




8979569 


93742 


CCATAAGGAGGTACTGCTGA 




7 


9188229 


135913- 


ATTCTTCCAGCCTCCAGGAA... 


17 and 18 






135859 


CACTACTAAGGTAAGTACCT 




8 


9188229 


130963- 


CTCCCCCTAGGTGTGTATGA... 


19 and 20 






130861 


GCCATATCAGGTAACTGGCA 




9 


9188229 


1 12742- 


CGTGCTGTAGGAAACCATTT... 


21 and 22 






112554 


ACTGCCTATGGTAGGTAGTG 
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Exon 


GINo. 


Residues - 


Boundary Sequences 


SEQ ID NO: 


10 


9588180 


36876- 


TTTCCCCCAGAAACAAGTGT... 


23 and 24 






36936 


AACTGTGCAGGTAAGGATAA 




11 


9588180 


65546- 


TCTGTCTCAGCTGCAAGTGT... 


25 and 26 






65744 


CCCTTTGAAGGTATTGGAAG 




11' 


9588180 


65546- 


TCTGTCTCAGCTGCAAGTGT... 


27 and 28 






65810 


ACAGAAAGAGGTCTGTCCTT 




12 


9588180 


83196- 


CTCTCGCCAGGAATCTTACA... 


29 and 30 2 






83315 


TCTCATTCATATCCTTTTCA 




13 


9213047 


974-5398 


GTGCCCACAGTCTCTCAGGA... 


31 and 32 








TATCCAAGGC 





- Note non-conserved splice donor sites in introns 4 and 12. 



- There is a discrepancy in the sequence between GI No. 95881 80 (genomic DNA clone; see 
TABLE 3) and 8 1 63868 (cDNA clone; SEQ ID NO: 1 ) at residue 83315. A "T" at that locus 
(residue No. 10 of SEQ ID NO: 30) was confirmed in all samples while the cDNA clone has 
10 a"G". 

Regions of sequence contained in the BAC sequences but not in the cDNA 
sequences are considered to be intron sequences (for those sequences which are flanked by 
exon sequences) of the DISCI or DISC2 genes, or are sequences which lie outside the 
1 5 transcribed 5 f and 3' boundaries of these genes, including a region (referred to here as the 5'- 
promoter region) of about 1 kb upstream of the transcription start site. 

EXAMPLE 3: Detection and Identification of Novel DISCI and DISC2 

Sequence Variations Associated With Neuropsychiatric Disorders 

20 This example describes experiments in which genetic sequences from 

individuals of selected populations, described supra in Example 1 , were analyzed and novel 

DISCI and/or DISC2 polymorphisms were identified. These DISCI and DISC2 variants, 
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as well as the altered gene products they encode, are described here for the first time, and 
represent novel DISCI and DISC2 sequences that can be used in the methods described 
supra, e.g., to diagnose and treat neuropsychiatric disorders, such as schizophrenia, 
schizoaffective disorder, bipolar affective disorder, unipolar affective disorder, adolescent 
5 conduct disorder, etc., in the general population. 



Materials and Methods 
PCR Amplification. DISCI and DISC2 genomic sequences were amplified 
10 according to standard PCR protocols described supra, using oligonucleotide primers 
described below and in FIG. 4. 



Single Stranded Conformation Polymorphism (SSCP) Analysis. SSCP 
analysis of genetic samples was carried out according to standard protocols described 
1 5 previously (see, in particular, Orita et al , Proc. Natl Acad. Sci USA. 1 989, 86:2766-2770) 
and using polyacrylamide gels that contain either 0% or 10% glycerol. The resolution 
achieved in a gel analysis of a specific polymorphism can vary somewhat depending on the 
glycerol concentration in the gel. Thus, in this screening analysis, both 0% and 1 0% glycerol 
gels were used. 

20 

Denaturing High Performance Liquid Chromatography (DHPLC). 
Standard DHPLC (Transgenomic, Inc., Omaha Nebraska) protocols previously described by 
Oefener & Underhill (Am. J. Hum. Genet. 1995, 57:A266) were used to analyze PCR 
amplified fragments of DISCI and DISC2 genomic sequences, with modifications to 
25 increase throughput. Specifically, genetic samples were pooled, after PCR amplification, by 
combining two DNA samples per well in a 96 well plate, giving 48 pooled samples. 

WAVEmaker software (Transgenomic, Inc.) was used to predict a melting 
temperature for each PCR amplified DISCI or DISC2 sequence analyzed, and to calculate 
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an appropriate buffer gradient for mutation detection. The buffers used for DHPLC analysis 
consisted of: Buffer A (0.1 M TEAA) and Buffer B (0.1 M TEAA and 25% acetonitrile). 

DNA Sequencing. DNA samples were sequenced using standard nucleic acid 
5 sequencing techniques discussed supra. 



Results 

1 0 PCR amplification products of the DISC 1 and DISC2 genomic sequences that 

contain exon (including the intron/exon junction), 5'-UTR, 3 f -UTR and regulatory (e.g. , 5'- 
promoter) sequences of the DISC 1 and/or DISC2 genes were generated from genetic samples 
obtained from individuals of the populations described in Example 1 , supra. The primers 
used are provided in FIGURE 4. The table in FIG. 4 describes primer sequence pairs 

15 (columns 3 and 4) for the identification/amplification of DISCI and/or DISC2 variants, as 
well as the location (column 5) and length (column 6) of the amplified sequence. The PCR 
primers were chosen to amplify DISCI and/or DISC2 sequences from about 150 to about 
450 bp in length, which are preferred size ranges for mutation analysis by the SSCP and 
DHPLC methods described here. 

20 The genetic samples included DNA samples obtained from individuals 

suffering from a neuropsychiatric disorder (core schizophrenia or schizoaffective disorder), 
as well as samples from control individuals who were not suffering from and did not exhibit 
symptoms of a neuropsychiatric disorder. 

The PCR amplification products were analyzed for polymorphisms using 

25 either SSCP, DHPLC or both. In the case of SSCP analysis, aliquots of PCR products 
amplified from the genomic DNA samples of appropriate individuals were heat denatured 
and electrophoresed in polyacrylamide gels, and variant nucleotides were detected by 
mobility shifts in the gel. Alternatively, DHPLC analysis used reverse-phase ion-pairing 
chromatography to detect heteroduplexes generated during amplification of PCR fragments 
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from individuals who are heterozygous at a particular nucleotide locus within the amplified 
fragment. Such heteroduplexes are detected by their differing retention times on the HPLC 
column. 

If variant nucleotides were detected by either SSCP or DHPLC, the remaining 
5 PCR product from the select individual(s) was sequenced to confirm and identify the 
polymorphism. 

TABLE 5A, below, summarizes SNPs identified in the DISCI genomic 
sequence described in Example 2, supra. In particular, column 3 (under the title "Residue 
no.") specifies the nucleotide residue in the DISCI genomic sequence, represented by the 

10 nucleotide sequences in TABLE 3, where each SNP is located. Column 4 (under the title 
"mutation") specifies the identity of the SNP. For example, the first SNP recited in the table 
(/. e. , discOla) is an SNP located at nucleotide residue number 1 872 of SEQ ID NO: 1 . This 
nucleotide is a cytosine (C) in the wild-type DISCI genomic sequence, whereas in the SNP, 
the nucleotide is mutated to a thymine (T), as indicated by "C/T" in column 4 of TABLE 5A. 

1 5 Column 5 (under the title "Genomic Region") specifies the region of the genomic sequence 
{e.g. , the specific exon, intron or 5 f -promoter region) where the SNP is located. TABLE 5B 
shows the polymorphic site and flanking sequences for each polymorphism, together with 
the corresponding genomic sequence (GI or SEQ ID NO.). 
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TABLE 5A 

SNPs Identified in DISCI Genomic Sequences 



Polymorphism 
ID 


GI No./ID 


Residue No. 


Mutation 
(WT/SNP) 


Genomic 
Region 


discOla 


9188229 


112716 


C/T 


Exon 9 


disc02a 


9588180 


37031 


G/A 


Intron 10 


disc03a 


9588180 


65613 


All 


Exon 1 1 


disc08a 


9213047 


1372 


G/A 


Exon 13 


disci 6a 


9213047 


3301 


C/T 


Exon 13 


discl8a 


9213047 


3794 


A/G 


Exon 13 


disci la 


9213047 


4555 


C/T 


Exon 13 


disc22a 


9213047 


4721 


G/T 


Exon 13 


disc22b 


9213047 


4832 


G/A 


Exon 13 


disc31a 


7239563 


133614 


C/G 


5'-promoter 


disc 43a 


zlhdisc43 
(SEQ ID NO: 4) 


176 


C/T 


Exon 6 
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TABLE 5B 






SNP Sequences 




Polymorphism ID 


Sequence 


SEQ ID NO: 


discOla 


GGCTAAAGACcTCACCGAGGA 


33 


disc02a 


AGGCTGGCCAgTTTCTCTAAT 


34 


disc03a 


GCTTATCCAGaGCCTACAGCT 


35 


disc08a 


GTCATTCTTGgGAATGTCTTC 


36 


disci 6a 


GTGTTCCCTAcGGCTTTATCT 


37 


disci 8a 


ACCACAAGCCaTAACTCATCT 


38 


disclla 


TGTGAGCCACcGCACCCGGCC 


39 


disc22a 


GCCCTTTACAgTCATATCCTA 


40 


disc22b 


CTGGTAAAATgTGAAGTAATA 


41 


disc31a 


CTG AG ATC CcG AGAC AGGCT 


42 


disc 43a 


AGAAAGAAATcGAAGCTCTCC 


43 



15 

Many of the SNPs identified are found in exons of the DISCI or DISC2 
genomic sequence. Thus, these SNPs may also generate an altered, transcribed gene product 
{e.g., an altered mRNA or an altered cDNA derived therefrom). These altered cDNA 
sequences are specified in TABLE 6, below, with respect to either the DISCI cDNA 
20 sequence (SEQ ID NO: 1) or the DISC2 cDNA sequence (SEQ ID NO:3) set forth in FIGS. 
1A-C and in FIGS. 3A-G, respectively. 
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TABLE 6A 



5 



SNPs Identified in 


DISCI cDNA Sequences 


Polymorphism 


SEQ ID NO. Residue No. 


Mutation 


ID 






(WT/SNP) 


discO la 


1 


1872 


it 


di^cO ?/7 

C4. 1 o C \J ^} L4 


1 


2163 


A /T 

A/1 


di^cORn 


1 


2879 


(jyA 


divr 1 f\n 

WfOu 1 (J Li 


1 


4816 


U I 


discl8a 


1 


5309 




disc21a 


1 


6069 


C/T 


disc22a 


1 


6236 


G/T 


disc22b 


1 


6347 


G/A 


disc43a 


1 


1460 


C/T 



Certain SNPs identified in TABLE 6A, above (i.e., disc08a, disci 6a, 
disc 18a, disc2 la, disc22a and disc22b) are located within an untranslated region (i. e. , the 5'- 
UTR or the 3'-UTR) of the DISCI or DISC2 cDNA sequence and are not expected, therefore, 

20 to affect the amino acid sequence of a polypeptide encoded by the cDNA. However, other 
SNPs identified in TABLE 6 A (i.e., discOla, disc02a, and disc43a) are located within the 
coding region of the indicated cDNA sequence and, further, change a codon of that coding 
sequence to one for a different amino acid residue. The disc43a mutation is silent, i. e. , the 
altered codon translates to the same amino acid as the wild-type codon. However, the cDNA 

25 sequences which comprise the discOla and disc02a SNPs do encode an altered gene product. 
Specifically, the polypeptides encoded by these SNPs comprise amino acid residue 
substitutions. The specific amino acid residue substitutions encoded by each of these SNPs 
are indicated in TABLE 6B, below. 
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TABLE 6B 

Amino Acid Substitutions Encoded by DISCI SNPs 









Mutation 


Polymorphism ID 


SEQ ID NO. 


Residue no. 


(WT/SNP) 


discOla 


2 


607 


Leu/Phe 


disc03a 


2 


704 


Ser/Cys 



Thus, for example, a DISC 1 nucleic acid containing the SNP discOla encodes 
an altered DISCI polypeptide. Specifically, this SNP encodes a variant of the polypeptide 

1 0 sequence set forth in SEQ ID NO:2 in which the amino acid residue at position 607 of this 
sequence is a phenylalanine (Phe) rather than a leucine (Leu). 

Nucleic acid molecules containing sequences which the SNPs disclosed here, 
as well as polypeptides encoded by such nucleic acids and containing the amino acid 
substitutions described above, are novel DISCI and DISC2 sequences of the present 

1 5 invention. Because these SNPs and amino acid substitutions are ones which correlate with 
the presence of a neuropsychiatric disorder (for example, schizophrenia, schizoaffective 
disorder, bipolar affective disorder, unipolar affective disorder and adolescent conduct 
disorder), these nucleic acids and polypeptides are particularly useful in the methods of the 
present invention, e.g., to diagnose and treat such disorders. 

20 

* * * 

The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in addition 
to those described herein will become apparent to those skilled in the art from the foregoing 
25 description and the accompanying figures. Such modifications are intended to fall within 
the scope of the appended claims. 

It is further to be understood that all values are approximate, and are provided 

for description. 
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Patents, patent applications, publications, product descriptions, and protocols 
are cited throughout this application, the disclosures of which are incorporated herein by 
reference in their entireties for all purposes. 
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